library(biomaRt)
library(DESeq2)
library(tidyverse)
package ‘dplyr’ was built under R version 3.5.1

Before starting this section, we will make sure we have all the relevant objects from the Differential Expression analysis.

load("Robjects/DE.Rdata")

Overview

Adding annotation to the DESeq2 results

We have a list of significantly differentially expressed genes, but the only annotation we can see is the Ensembl Gene ID, which is not very informative.

There are a number of ways to add annotation. One method is to do this using the org.Mm.eg.db package. This package is one of several organism-level packages which are re-built every 6 months. These packages are listed on the annotation section of the Bioconductor, and are installed in the same way as regular Bioconductor packages.

An alternative approach is to use biomaRt, an interface to the BioMart resource. This is the method we will use today.

Select BioMart database and dataset

The first step is to select the Biomart database we are going to access and which data set we are going to use.

# view the available databases
listMarts()
## set up connection to ensembl database
ensembl=useMart("ENSEMBL_MART_ENSEMBL")
# list the available datasets (species)
listDatasets(ensembl) %>% 
    filter(str_detect(description, "Mouse"))
# specify a data set to use
ensembl = useDataset("mmusculus_gene_ensembl", mart=ensembl)

Query the database

Now we need to set up a query. For this we need to specify three things:

  1. What type of information we are going to search the dataset on - called filters. In our case this is Ensembl Gene IDs
  2. A vector of the values for our filter - the Ensembl Gene IDs from our DE results table
  3. What columns (attributes) of the dataset we want returned.

Returning data from Biomart can take time, so it’s always a good idea to test your query on a small list of values first to make sure it is doing what you want. We’ll just use the first 1000 genes for now.

# check the available "filters" - things you can filter for
listFilters(ensembl) %>% 
    filter(str_detect(name, "ensembl"))
# Set the filter type and values
filterType <- "ensembl_gene_id"
filterValues <- rownames(resLvV)[1:1000]
# check the available "attributes" - things you can retreive
listAttributes(ensembl) %>% 
    head(20)
# Set the list of attributes
attributeNames <- c('ensembl_gene_id', 'entrezgene', 'external_gene_name')
# run the query
annot <- getBM(attributes=attributeNames, 
               filters = filterType, 
               values = filterValues, 
               mart = ensembl)

Batch submitting query [====================>----------]  67% eta:  1s
Batch submitting query [===============================] 100% eta:  0s
                                                                      

One-to-many relationships

Let’s inspect the annotation.

head(annot)
dim(annot) # why are there more than 1000 rows?
[1] 1001    3
length(unique(annot$ensembl_gene_id)) # why are there less than 1000 Gene ids?
[1] 999
isDup <- duplicated(annot$ensembl_gene_id)
dup <- annot$ensembl_gene_id[isDup]
annot[annot$ensembl_gene_id%in%dup,]

There are a couple of genes that have multiple entries in the retrieved annotation. This is becaues there are multiple Entrez IDs for a single Ensembl gene. These one-to-many relationships come up frequently in genomic databases, it is important to be aware of them and check when necessary.

We will need to do a little work before adding the annotation to out results table. We could decide to discard one or both of the Entrez ID mappings, or we could concatenate the Entrez IDs so that we don’t lose information.

Retrieve full annotation

Challenge

That was just 1000 genes. We need annotations for the entire results table. Also, there may be some other interesting columns in BioMart that we wish to retrieve.

  1. Search the attributes and add the following to our list of attributes:
    1. The gene description
    2. The genomic position - chromosome, start, end, and strand (4 columns)
    3. The gene biotype
  2. Query BioMart using all of the genes in our results table (resLvV)
  3. How many Ensembl genes have multipe Entrez IDs associated with them?
  4. How many Ensembl genes in resLvV don’t have any annotation? Why is this?
# filterValues <- rownames(resLvV)
# 
# # check the available "attributes" - things you can retreive
# listAttributes(ensembl) %>%
#     head(20)
# attributeNames <- c('ensembl_gene_id', 
#                     'entrezgene',
#                     'external_gene_name',
#                     'description',
#                     'gene_biotype',
#                     'chromosome_name',
#                     'start_position',
#                     'end_position',
#                     'strand')
# 
# # run the query
# annot <- getBM(attributes=attributeNames,
#                filters = filterType,
#                values = filterValues,
#                mart = ensembl)
# 
# sum(duplicated(annot$ensembl_gene_id))
# missingGenes <- !rownames(resLvV)%in%annot$ensembl_gene_id
# rownames(resLvV)[missingGenes]

Add annotation to the results table

We can now add the annotation to the results table and then save the results using the write_tsv function, which writes the results out to a tab separated file. To save time we have created an annotation table in which we have modified the cumbersome Biomart column names, and dealt with the one-to-many issues for Entrez IDs.

ensemblAnnot <- read_tsv("data/Ensembl_annotations.tsv")
colnames(ensemblAnnot)
[1] "GeneID"      "Entrez"      "Symbol"      "Description" "Biotype"    
[6] "Chr"         "Start"       "End"         "Strand"     
resTab <- as.data.frame(resLvV) %>% 
    rownames_to_column("GeneID") %>% 
    left_join(ensemblAnnot, "GeneID") %>% 
    rename(logFC=log2FoldChange, FDR=padj)

Finally we can output the annotation DE results using write_csv.

write_tsv(resTab, "results/VirginVsLactating_Results_Annotated.txt")

Challenge

Have a look at gene symbols for most significant genes by adjusted p-value. Do they make biological sense in the context of comparing gene expression in mammary gland tissue between lactating and virgin mice? You may want to do a quick web search of your favourite gene/protein database

Visualisation

DESeq2 provides a functon called lfcShrink that shrinks log-Fold Change (LFC) estimates towards zero using and empirical Bayes procedure. The reason for doing this is that there is high variance in the LFC estimates when counts are low and this results in lowly expressed genes appearing to be show greater differences between groups that highly expressed genes. The lfcShrink method compensates for this and allows better visualisation and ranking of genes. We will use it for our visualisations of the data.

ddsShrink <- lfcShrink(ddsObj, coef="Status_lactate_vs_virgin")
resTab <- ddsShrink %>% 
    as.data.frame() %>% 
    rownames_to_column("GeneID") %>% 
    left_join(ensemblAnnot, "GeneID") %>% 
    rename(logFC=log2FoldChange, FDR=padj)

P-value histogram

A quick and easy “sanity check” for our DE results is to generate a p-value histogram. What we should see is a high bar in the 0 - 0.05 and then a roughly uniform tail to the right of this. There is a nice explanation of other possible patterns in the histogram and what to when you see them in this post.

hist(resTab$pvalue)

MA plots

MA plots are a common way to visualize the results of a differential analysis. We met them briefly towards the end of Session 2. This plot shows the log-Fold Change for each gene against its average expression across all samples in the two conditions being contrasted.

DESeq2 has a handy function for plotting this…

plotMA(ddsShrink, alpha=0.05)

…this is fine for a quick look, but it is not easy to make changes to the way it looks or add things such as gene labels. Perhaps we would like to add labels for the top 20 most significantly differentially expressed genes. Let’s use ggplot2 instead.

# add a column with the names of only the top 10 genes
cutoff <- sort(resTab$pvalue)[10]
resTab <- resTab %>% 
    mutate(TopGeneLabel=ifelse(pvalue<=cutoff, Symbol, ""))
ggplot(resTab, aes(x = log2(baseMean), y=logFC)) + 
    geom_point(aes(colour=FDR < 0.05), pch=20, size=0.5) +
    geom_text(aes(label=TopGeneLabel)) +
    labs(x="mean of normalised counts", y="log fold change")

Volcano plot

Another common visualisation is the volcano plot which displays a measure of significance on the y-axis and fold-change on the x-axis. In this case we use the log2 fold change (logFC) on the x-axis, and on the y-axis we’ll use -log10(FDR). This -log10 transformation is commonly used for p-values as it means that more significant genes have a higher scale. We should first remove the genes that we excluded by the independent filtering process of DESeq2

# first remove the filtered genes (FDR=NA) and create a -log10(FDR) column
filtTab <- resTab %>% 
    filter(!is.na(FDR)) %>% 
    mutate(`-log10(FDR)` = -log10(FDR))
ggplot(filtTab, aes(x = logFC, y=`-log10(FDR)`)) + 
    geom_point(aes(colour=FDR < 0.05), size=2)

We could limit the values at the top of the plot so that we can see the lower portion more clearly.

filtTab <- filtTab %>% 
    mutate(`-log10(FDR)`=pmin(`-log10(FDR)`, 51))
ggplot(filtTab, aes(x = logFC, y=`-log10(FDR)`)) + 
    geom_point(aes(colour=FDR < 0.05, shape = `-log10(FDR)` > 50), size=2)

Strip chart for gene expression

Before following up on the DE genes with further lab work, a recommended sanity check is to have a look at the expression levels of the individual samples for the genes of interest. We can quickly look at grouped expression using stripchart. We can retrieve the normalised expression values in the ddsObj object using the counts function from DESeq2.

normCounts <- counts(ddsObj, normalized=TRUE) %>% 
    log2()
# Let's look at the most significantly differentially expressed gene: Wap
topgene <- filter(resTab, Symbol=="Wap")
topgene
groups <- colData(ddsObj)$Group
par(mar=c(8,4,2,2)) #adjust the plot margins the x-labels are visible - see ?par
stripchart(normCounts["ENSMUSG00000000381",]~groups,
           col=1:6,
           vertical=TRUE,
           pch=21,
           las=2,
           cex=2,
           xlab="",
           ylab="log2(Counts)",
           main="Normalised Counts - Wap")

Interactive StripChart with Glimma

An interactive version of the volcano plot above that includes the raw per sample values in a separate panel is possible via the glXYPlot function in the Glimma package.

library(Glimma)
group <- as.factor(sampleinfo$Group)
levels(group) <- c("basal.lact","basal.preg","basal.vir",
                   "lum.lact", "lum.preg", "lum.vir")
annot.mod <- filtTab[,c("GeneID", "Symbol", "Description")]
de <- as.numeric(filtTab$FDR<=0.05)
filtCounts <- normCounts[filtTab$GeneID,]
glXYPlot(x=filtTab$logFC, y=-log10(filtTab$FDR),
         xlab="logFC", ylab="FDR", main="Lactating v Virgin",
         counts=filtCounts, groups=group, status=de,
         anno=annot.mod, id.column="ENTREZID", folder="volcano")

This function creates an html page (./volcano/XY-Plot.html) with a volcano plot on the left and a plot showing the log-CPM per sample for a selected gene on the right. A search bar is available to search for genes of interest.

Additional Material

Retrieving Detailed Genomic Locations

. There is a whole suite of annotation packages that can be used to access this information, and for performing more-advanced queries that relate to the location of genes. These are listed on the Bioconductor annotation page and have the prefix TxDb. (where “tx” is “transcript”). In addition there are a large number of packages that make use of these annotations for downstream analyses and visualisations.

Unfortunately, these packages do not cover all species and tend only to be available for UCSC genomes. Thankfully, there is a way to build your own database from either a GTF file or from various online resources such as Biomart using the package GenomicFeatures.

library(GenomicFeatures)
txMm <- makeTxDbFromBiomart(dataset="mmusculus_gene_ensembl")
# 
# makeTxDbPackageFromBiomart(version="0.99.0",
#                            maintainer="Some One <so@someplace.org>",
#                            author="Some One <so@someplace.com>",
#                            dataset="mmusculus_gene_ensembl")
# library(TxDb.Mmusculus.BioMart.ENSEMBLMARTENSEMBL.GRCm38.p6)
# txMm <- TxDb.Mmusculus.BioMart.ENSEMBLMARTENSEMBL.GRCm38.p6

Accessing the information in these TxDb databases is similar to the way in which we accessed information using biomaRt except that filters (the information we are filtering on) are now called keys and attributes (things we want to retrieve) are columns.

First we need to decide what information we want. In order to see what we can extract we can run the columns function on the annotation database.

columns(txMm)
 [1] "CDSCHROM"   "CDSEND"     "CDSID"      "CDSNAME"    "CDSPHASE"  
 [6] "CDSSTART"   "CDSSTRAND"  "EXONCHROM"  "EXONEND"    "EXONID"    
[11] "EXONNAME"   "EXONRANK"   "EXONSTART"  "EXONSTRAND" "GENEID"    
[16] "TXCHROM"    "TXEND"      "TXID"       "TXNAME"     "TXSTART"   
[21] "TXSTRAND"   "TXTYPE"    

We are going to filter the database by a key or set of keys in order to extract the information we want. Valid names for the key can be retrieved with the keytypes function.

keytypes(txMm)
[1] "CDSID"    "CDSNAME"  "EXONID"   "EXONNAME" "GENEID"   "TXID"     "TXNAME"  

To extract information we use the select function. Let’s get transcript information for our most highly differentially expressed gene.

keyList <- ensemblAnnot$GeneID[ensemblAnnot$Symbol=="Wap"]
select(txMm, 
       keys=keyList,
       keytype = "GENEID",
       columns=c("TXNAME", "TXCHROM", "TXSTART", "TXEND", "TXSTRAND", "TXTYPE")
      )
'select()' returned 1:many mapping between keys and columns

Challenge 2

Use the txMm to retrieve the exon coordinates for the genes: + ENSMUSG00000021604 + ENSMUSG00000022146 + ENSMUSG00000040118

Overview of GenomicRanges

One of the real strengths of the txdb.. databases is the ability to interface with GenomicRanges, which is the object type used throughout Bioconductor to manipulate Genomic Intervals.

These object types permit us to perform common operations on intervals such as overlapping and counting. We can define the chromosome, start and end position of each region (also strand too, but not shown here).

library(GenomicRanges)
simple_range <- GRanges(seqnames = "1", ranges = IRanges(start=1000, end=2000))
simple_range
GRanges object with 1 range and 0 metadata columns:
      seqnames    ranges strand
         <Rle> <IRanges>  <Rle>
  [1]        1 1000-2000      *
  -------
  seqinfo: 1 sequence from an unspecified genome; no seqlengths

We don’t have to have all our ranges located on the same chromosome

chrs <- c("13", "15", "5")
start <- c(73000000, 6800000, 15000000)
end <- c(74000000, 6900000, 16000000)
my_ranges <- GRanges(seqnames = rep(chrs, 3),
                     ranges = IRanges(start = rep(start, each = 3),
                                      end = rep(end, each = 3))
                     )
my_ranges
GRanges object with 9 ranges and 0 metadata columns:
      seqnames            ranges strand
         <Rle>         <IRanges>  <Rle>
  [1]       13 73000000-74000000      *
  [2]       15 73000000-74000000      *
  [3]        5 73000000-74000000      *
  [4]       13   6800000-6900000      *
  [5]       15   6800000-6900000      *
  [6]        5   6800000-6900000      *
  [7]       13 15000000-16000000      *
  [8]       15 15000000-16000000      *
  [9]        5 15000000-16000000      *
  -------
  seqinfo: 3 sequences from an unspecified genome; no seqlengths

There are a number of useful functions for calculating properties of the data (such as coverage or sorting). Not so much for RNA-seq analysis, but GenomicRanges are used throughout Bioconductor for the analysis of NGS data.

For instance, we can quickly identify overlapping regions between two GenomicRanges.

keys <- c("ENSMUSG00000021604", "ENSMUSG00000022146", "ENSMUSG00000040118")
genePos <- select(txMm,
                  keys = keys,
                  keytype = "GENEID",
                  columns = c("EXONCHROM", "EXONSTART", "EXONEND")
                  )
'select()' returned 1:many mapping between keys and columns
geneRanges <- GRanges(genePos$EXONCHROM, 
                      ranges = IRanges(genePos$EXONSTART, genePos$EXONEND), 
                      GENEID = genePos$GENEID)
geneRanges
GRanges object with 96 ranges and 1 metadata column:
       seqnames            ranges strand |             GENEID
          <Rle>         <IRanges>  <Rle> |        <character>
   [1]       13 73260479-73260653      * | ENSMUSG00000021604
   [2]       13 73264848-73264979      * | ENSMUSG00000021604
   [3]       13 73265458-73265709      * | ENSMUSG00000021604
   [4]       13 73266596-73266708      * | ENSMUSG00000021604
   [5]       13 73267504-73267832      * | ENSMUSG00000021604
   ...      ...               ...    ... .                ...
  [92]        5 16327973-16329883      * | ENSMUSG00000040118
  [93]        5 16326151-16326383      * | ENSMUSG00000040118
  [94]        5 16340707-16341059      * | ENSMUSG00000040118
  [95]        5 16361395-16361875      * | ENSMUSG00000040118
  [96]        5 16362265-16362326      * | ENSMUSG00000040118
  -------
  seqinfo: 3 sequences from an unspecified genome; no seqlengths
findOverlaps(my_ranges, geneRanges)
Hits object with 40 hits and 0 metadata columns:
       queryHits subjectHits
       <integer>   <integer>
   [1]         1           1
   [2]         1           2
   [3]         1           3
   [4]         1           4
   [5]         1           5
   ...       ...         ...
  [36]         9          36
  [37]         9          75
  [38]         9          84
  [39]         9          85
  [40]         9          87
  -------
  queryLength: 9 / subjectLength: 96

However, we have to pay attention to the naming convention used for each object. seqlevelsStyle can help.

seqlevelsStyle(simple_range)
[1] "NCBI"    "Ensembl" "MSU6"    "AGPvF"  
seqlevelsStyle(my_ranges)
[1] "NCBI"    "Ensembl" "JGI2.F" 
seqlevelsStyle(geneRanges)
[1] "NCBI"    "Ensembl" "JGI2.F" 

Exporting tracks

It is also possible to save the results of a Bioconductor analysis in a browser to enable interactive analysis and integration with other data types, or sharing with collaborators. For instance, we might want a browser track to indicate where our differentially-expressed genes are located. We shall use the bed format to display these locations. We will annotate the ranges with information from our analysis such as the fold-change and significance.

First we create a data frame for just the DE genes.

sigGenes <- filter(resTab, FDR <= 0.01)
message("Number of significantly DE genes: ", nrow(sigGenes))
Number of significantly DE genes: 4279
head(sigGenes)

Create a genomic ranges object

Several convenience functions exist to retrieve the structure of every gene from a given TxDb object in one list. The output of exonsBy is a list, where each item in the list is the exon co-ordinates of a particular gene, however, we do not need this level of granularity for the bed output, so we will collapse to a single region for each gene.

First we use the range function to obtain a single range for every gene and tranform to a more convenient object with unlist.

exoRanges <- exonsBy(txMm, "gene") %>% 
    range() %>% 
    unlist()
sigRegions <- exoRanges[na.omit(match(sigGenes$GeneID, names(exoRanges)))]
sigRegions
GRanges object with 4271 ranges and 0 metadata columns:
                     seqnames          ranges strand
                        <Rle>       <IRanges>  <Rle>
  ENSMUSG00000025903        1 4807788-4848410      +
  ENSMUSG00000103280        1 4905751-4906861      -
  ENSMUSG00000033793        1 5070018-5162529      +
  ENSMUSG00000051285        1 7088920-7173628      +
  ENSMUSG00000103509        1 7148110-7152137      +
                 ...      ...             ...    ...
  ENSMUSG00000064354       MT       7013-7696      +
  ENSMUSG00000064357       MT       7927-8607      +
  ENSMUSG00000064363       MT     10167-11544      +
  ENSMUSG00000064367       MT     11742-13565      +
  ENSMUSG00000064368       MT     13552-14070      -
  -------
  seqinfo: 139 sequences (1 circular) from an unspecified genome

For visualisation purposes, we are going to restrict the data to genes that are located on chromosomes 1 to 19 and the sex chromosomes. This can be done with the keepSeqLevels function.

seqlevels(sigRegions)
  [1] "CHR_CAST_EI_MMCHR11_CTG4"  "CHR_CAST_EI_MMCHR11_CTG5" 
  [3] "CHR_MG104_PATCH"           "CHR_MG117_PATCH"          
  [5] "CHR_MG132_PATCH"           "CHR_MG153_PATCH"          
  [7] "CHR_MG171_PATCH"           "CHR_MG184_PATCH"          
  [9] "CHR_MG190_MG3751_PATCH"    "CHR_MG191_PATCH"          
 [11] "CHR_MG209_PATCH"           "CHR_MG3172_PATCH"         
 [13] "CHR_MG3231_PATCH"          "CHR_MG3251_PATCH"         
 [15] "CHR_MG3490_PATCH"          "CHR_MG3496_PATCH"         
 [17] "CHR_MG3530_PATCH"          "CHR_MG3561_PATCH"         
 [19] "CHR_MG3562_PATCH"          "CHR_MG3609_PATCH"         
 [21] "CHR_MG3618_PATCH"          "CHR_MG3627_PATCH"         
 [23] "CHR_MG3648_PATCH"          "CHR_MG3656_PATCH"         
 [25] "CHR_MG3683_PATCH"          "CHR_MG3686_PATCH"         
 [27] "CHR_MG3699_PATCH"          "CHR_MG3700_PATCH"         
 [29] "CHR_MG3712_PATCH"          "CHR_MG3714_PATCH"         
 [31] "CHR_MG3829_PATCH"          "CHR_MG3833_MG4220_PATCH"  
 [33] "CHR_MG3835_PATCH"          "CHR_MG3836_PATCH"         
 [35] "CHR_MG3999_PATCH"          "CHR_MG4136_PATCH"         
 [37] "CHR_MG4138_PATCH"          "CHR_MG4151_PATCH"         
 [39] "CHR_MG4162_PATCH"          "CHR_MG4180_PATCH"         
 [41] "CHR_MG4198_PATCH"          "CHR_MG4200_PATCH"         
 [43] "CHR_MG4209_PATCH"          "CHR_MG4211_PATCH"         
 [45] "CHR_MG4212_PATCH"          "CHR_MG4213_PATCH"         
 [47] "CHR_MG4214_PATCH"          "CHR_MG4222_MG3908_PATCH"  
 [49] "CHR_MG4243_PATCH"          "CHR_MG4248_PATCH"         
 [51] "CHR_MG4249_PATCH"          "CHR_MG4254_PATCH"         
 [53] "CHR_MG4255_PATCH"          "CHR_MG4259_PATCH"         
 [55] "CHR_MG4261_PATCH"          "CHR_MG4264_PATCH"         
 [57] "CHR_MG4265_PATCH"          "CHR_MG4266_PATCH"         
 [59] "CHR_MG4281_PATCH"          "CHR_MG4288_PATCH"         
 [61] "CHR_MG4308_PATCH"          "CHR_MG4310_MG4311_PATCH"  
 [63] "CHR_MG51_PATCH"            "CHR_MG65_PATCH"           
 [65] "CHR_MG74_PATCH"            "CHR_MG89_PATCH"           
 [67] "CHR_MMCHR1_CHORI29_IDD5_1" "CHR_PWK_PHJ_MMCHR11_CTG1" 
 [69] "CHR_PWK_PHJ_MMCHR11_CTG2"  "CHR_PWK_PHJ_MMCHR11_CTG3" 
 [71] "CHR_WSB_EIJ_MMCHR11_CTG1"  "CHR_WSB_EIJ_MMCHR11_CTG2" 
 [73] "CHR_WSB_EIJ_MMCHR11_CTG3"  "1"                        
 [75] "2"                         "3"                        
 [77] "4"                         "5"                        
 [79] "6"                         "7"                        
 [81] "8"                         "9"                        
 [83] "10"                        "11"                       
 [85] "12"                        "13"                       
 [87] "14"                        "15"                       
 [89] "16"                        "17"                       
 [91] "18"                        "19"                       
 [93] "X"                         "Y"                        
 [95] "MT"                        "GL456210.1"               
 [97] "GL456211.1"                "GL456212.1"               
 [99] "GL456213.1"                "GL456216.1"               
[101] "GL456219.1"                "GL456221.1"               
[103] "GL456233.1"                "GL456239.1"               
[105] "GL456350.1"                "GL456354.1"               
[107] "GL456359.1"                "GL456360.1"               
[109] "GL456366.1"                "GL456367.1"               
[111] "GL456368.1"                "GL456370.1"               
[113] "GL456372.1"                "GL456378.1"               
[115] "GL456379.1"                "GL456381.1"               
[117] "GL456382.1"                "GL456383.1"               
[119] "GL456385.1"                "GL456387.1"               
[121] "GL456389.1"                "GL456390.1"               
[123] "GL456392.1"                "GL456393.1"               
[125] "GL456394.1"                "GL456396.1"               
[127] "JH584292.1"                "JH584293.1"               
[129] "JH584294.1"                "JH584295.1"               
[131] "JH584296.1"                "JH584297.1"               
[133] "JH584298.1"                "JH584299.1"               
[135] "JH584300.1"                "JH584301.1"               
[137] "JH584302.1"                "JH584303.1"               
[139] "JH584304.1"               
sigRegions <- keepSeqlevels(sigRegions, 
                            value = c(1:19,"X","Y"),
                            pruning.mode="tidy")
seqlevels(sigRegions)
 [1] "1"  "2"  "3"  "4"  "5"  "6"  "7"  "8"  "9"  "10" "11" "12" "13" "14" "15"
[16] "16" "17" "18" "19" "X"  "Y" 

Add metadata to GRanges object

A useful propery of GenomicRanges is that we can attach metadata to each range using the mcols function. The metadata can be supplied in the form of a data frame.

mcols(sigRegions) <- sigGenes[match(names(sigRegions), sigGenes$GeneID), ]
sigRegions
GRanges object with 4263 ranges and 16 metadata columns:
                     seqnames            ranges strand |             GeneID
                        <Rle>         <IRanges>  <Rle> |        <character>
  ENSMUSG00000025903        1   4807788-4848410      + | ENSMUSG00000025903
  ENSMUSG00000103280        1   4905751-4906861      - | ENSMUSG00000103280
  ENSMUSG00000033793        1   5070018-5162529      + | ENSMUSG00000033793
  ENSMUSG00000051285        1   7088920-7173628      + | ENSMUSG00000051285
  ENSMUSG00000103509        1   7148110-7152137      + | ENSMUSG00000103509
                 ...      ...               ...    ... .                ...
  ENSMUSG00000033478       19 57361009-57389594      + | ENSMUSG00000033478
  ENSMUSG00000040022       19 59902884-59943654      - | ENSMUSG00000040022
  ENSMUSG00000024993       19 60811585-60836227      + | ENSMUSG00000024993
  ENSMUSG00000024997       19 60864051-60874556      - | ENSMUSG00000024997
  ENSMUSG00000074733       19 61053840-61140840      - | ENSMUSG00000074733
                             baseMean             logFC             lfcSE
                            <numeric>         <numeric>         <numeric>
  ENSMUSG00000025903 724.446609497753  0.64787137782662 0.144229304891253
  ENSMUSG00000103280 11.0727087099247 -1.58750612880118 0.434503203076042
  ENSMUSG00000033793 1263.66334600512 0.877213503228488 0.106454855140229
  ENSMUSG00000051285  1483.9749407736  1.29960059994037 0.176033709290278
  ENSMUSG00000103509 25.8677212181917  1.18134725817846 0.299145205180617
                 ...              ...               ...               ...
  ENSMUSG00000033478 604.940764140757 0.485457965666896 0.143244108885519
  ENSMUSG00000040022 420.277348654596  1.04903357170258 0.177594139807896
  ENSMUSG00000024993 273.706275359975 0.570208882336257 0.163411073403972
  ENSMUSG00000024997 1155.47226515427 0.896987748935108 0.184738597369257
  ENSMUSG00000074733 151.521609493818 0.831352686057778 0.159578210641601
                                  stat               pvalue
                             <numeric>            <numeric>
  ENSMUSG00000025903  4.50342169142557 6.68680141243707e-06
  ENSMUSG00000103280 -3.55360486193175  0.00037998967372985
  ENSMUSG00000033793  8.25203716658962 1.55716974876409e-16
  ENSMUSG00000051285  7.35737302703237 1.87564671714221e-13
  ENSMUSG00000103509  3.84551924990226 0.000120297419390057
                 ...               ...                  ...
  ENSMUSG00000033478  3.38170660330777 0.000720370394442405
  ENSMUSG00000040022  5.87606192866585 4.20141204139746e-09
  ENSMUSG00000024993  3.45286302118763 0.000554670584402014
  ENSMUSG00000024997  4.88409037207611  1.0390741291339e-06
  ENSMUSG00000074733  5.14752752400069 2.63942285548567e-07
                                      FDR    Entrez      Symbol
                                <numeric> <integer> <character>
  ENSMUSG00000025903 6.82491399525833e-05     18777      Lypla1
  ENSMUSG00000103280  0.00227338281229412      <NA>     Gm37277
  ENSMUSG00000033793 1.34050472716004e-14    108664     Atp6v1h
  ENSMUSG00000051285 9.64437264692718e-12    319263      Pcmtd1
  ENSMUSG00000103509 0.000849854587410265      <NA>     Gm38372
                 ...                  ...       ...         ...
  ENSMUSG00000033478  0.00387098901394704    226252    Fam160b1
  ENSMUSG00000040022 9.31606807547629e-08     74998   Rab11fip2
  ENSMUSG00000024993  0.00311011136700511     67894      Fam45a
  ENSMUSG00000024997 1.31438732092901e-05     11757       Prdx3
  ENSMUSG00000074733 3.88962198494305e-06    414758      Zfp950
                                                                                                                                               Description
                                                                                                                                               <character>
  ENSMUSG00000025903                                                                 Acyl-protein thioesterase 1  [Source:UniProtKB/Swiss-Prot;Acc:P97823]
  ENSMUSG00000103280                                                                             predicted gene, 37277 [Source:MGI Symbol;Acc:MGI:5610505]
  ENSMUSG00000033793                                                              V-type proton ATPase subunit H  [Source:UniProtKB/Swiss-Prot;Acc:Q8BVE3]
  ENSMUSG00000051285                      protein-L-isoaspartate (D-aspartate) O-methyltransferase domain containing 1 [Source:MGI Symbol;Acc:MGI:2441773]
  ENSMUSG00000103509                                                                             predicted gene, 38372 [Source:MGI Symbol;Acc:MGI:5611600]
                 ...                                                                                                                                   ...
  ENSMUSG00000033478                                                                            Protein FAM160B1  [Source:UniProtKB/Swiss-Prot;Acc:Q8CDM8]
  ENSMUSG00000040022                                                      RAB11 family interacting protein 2 (class I) [Source:MGI Symbol;Acc:MGI:1922248]
  ENSMUSG00000024993 Mus musculus family with sequence similarity 45, member A (Fam45a), transcript variant 3, mRNA. [Source:RefSeq mRNA;Acc:NM_001347464]
  ENSMUSG00000024997                                     Thioredoxin-dependent peroxide reductase, mitochondrial  [Source:UniProtKB/Swiss-Prot;Acc:P20108]
  ENSMUSG00000074733                                                                           zinc finger protein 950 [Source:MGI Symbol;Acc:MGI:2652824]
                            Biotype         Chr     Start       End    Strand
                        <character> <character> <integer> <integer> <integer>
  ENSMUSG00000025903 protein_coding           1   4807788   4848410         1
  ENSMUSG00000103280            TEC           1   4905751   4906861        -1
  ENSMUSG00000033793 protein_coding           1   5070018   5162529         1
  ENSMUSG00000051285 protein_coding           1   7088920   7173628         1
  ENSMUSG00000103509            TEC           1   7148110   7152137         1
                 ...            ...         ...       ...       ...       ...
  ENSMUSG00000033478 protein_coding          19  57361009  57389594         1
  ENSMUSG00000040022 protein_coding          19  59902884  59943654        -1
  ENSMUSG00000024993 protein_coding          19  60811585  60836227         1
  ENSMUSG00000024997 protein_coding          19  60864051  60874556        -1
  ENSMUSG00000074733 protein_coding          19  61053840  61140840        -1
                     TopGeneLabel
                      <character>
  ENSMUSG00000025903             
  ENSMUSG00000103280             
  ENSMUSG00000033793             
  ENSMUSG00000051285             
  ENSMUSG00000103509             
                 ...          ...
  ENSMUSG00000033478             
  ENSMUSG00000040022             
  ENSMUSG00000024993             
  ENSMUSG00000024997             
  ENSMUSG00000074733             
  -------
  seqinfo: 21 sequences from an unspecified genome

Scores and colour on exported tracks

The .bed file format is commonly used to store genomic locations for display in genome browsers (e.g. the UCSC browser or IGV) as tracks. Rather than just representing the genomic locations, the .bed format is also able to colour each range according to some property of the analysis (e.g. direction and magnitude of change) to help highlight particular regions of interest. A score can also be displayed when a particular region is clicked-on.

For the score we can use the \(-log_{10}\) of the adjusted p-value and colour scheme for the regions based on the fold-change

colorRampPalette is a useful function in base R for constructing a palette between two extremes. When choosing colour palettes, make sure they are colour blind friendly. The red / green colour scheme traditionally-applied to microarrays is a bad choice.

We will also truncate the fold-changes to between -5 and 5 to and divide this range into 10 equal bins

rbPal <- colorRampPalette(c("red", "blue"))
logFC <- pmax(sigRegions$logFC, -5)
logFC <- pmin(logFC , 5)
Cols <- rbPal(10)[as.numeric(cut(logFC, breaks = 10))]

The colours and score have to be saved in the GRanges object as score and itemRgb columns respectively, and will be used to construct the browser track. The rtracklayer package can be used to import and export browsers tracks.

Now we can export the signifcant results from the DE analysis as a .bed track using rtracklayer. You can load the resulting file in IGV, if you wish.

mcols(sigRegions)$score <- -log10(sigRegions$FDR)
mcols(sigRegions)$itemRgb <- Cols
sigRegions
GRanges object with 4263 ranges and 18 metadata columns:
                     seqnames            ranges strand |             GeneID
                        <Rle>         <IRanges>  <Rle> |        <character>
  ENSMUSG00000025903        1   4807788-4848410      + | ENSMUSG00000025903
  ENSMUSG00000103280        1   4905751-4906861      - | ENSMUSG00000103280
  ENSMUSG00000033793        1   5070018-5162529      + | ENSMUSG00000033793
  ENSMUSG00000051285        1   7088920-7173628      + | ENSMUSG00000051285
  ENSMUSG00000103509        1   7148110-7152137      + | ENSMUSG00000103509
                 ...      ...               ...    ... .                ...
  ENSMUSG00000033478       19 57361009-57389594      + | ENSMUSG00000033478
  ENSMUSG00000040022       19 59902884-59943654      - | ENSMUSG00000040022
  ENSMUSG00000024993       19 60811585-60836227      + | ENSMUSG00000024993
  ENSMUSG00000024997       19 60864051-60874556      - | ENSMUSG00000024997
  ENSMUSG00000074733       19 61053840-61140840      - | ENSMUSG00000074733
                             baseMean             logFC             lfcSE
                            <numeric>         <numeric>         <numeric>
  ENSMUSG00000025903 724.446609497753  0.64787137782662 0.144229304891253
  ENSMUSG00000103280 11.0727087099247 -1.58750612880118 0.434503203076042
  ENSMUSG00000033793 1263.66334600512 0.877213503228488 0.106454855140229
  ENSMUSG00000051285  1483.9749407736  1.29960059994037 0.176033709290278
  ENSMUSG00000103509 25.8677212181917  1.18134725817846 0.299145205180617
                 ...              ...               ...               ...
  ENSMUSG00000033478 604.940764140757 0.485457965666896 0.143244108885519
  ENSMUSG00000040022 420.277348654596  1.04903357170258 0.177594139807896
  ENSMUSG00000024993 273.706275359975 0.570208882336257 0.163411073403972
  ENSMUSG00000024997 1155.47226515427 0.896987748935108 0.184738597369257
  ENSMUSG00000074733 151.521609493818 0.831352686057778 0.159578210641601
                                  stat               pvalue
                             <numeric>            <numeric>
  ENSMUSG00000025903  4.50342169142557 6.68680141243707e-06
  ENSMUSG00000103280 -3.55360486193175  0.00037998967372985
  ENSMUSG00000033793  8.25203716658962 1.55716974876409e-16
  ENSMUSG00000051285  7.35737302703237 1.87564671714221e-13
  ENSMUSG00000103509  3.84551924990226 0.000120297419390057
                 ...               ...                  ...
  ENSMUSG00000033478  3.38170660330777 0.000720370394442405
  ENSMUSG00000040022  5.87606192866585 4.20141204139746e-09
  ENSMUSG00000024993  3.45286302118763 0.000554670584402014
  ENSMUSG00000024997  4.88409037207611  1.0390741291339e-06
  ENSMUSG00000074733  5.14752752400069 2.63942285548567e-07
                                      FDR    Entrez      Symbol
                                <numeric> <integer> <character>
  ENSMUSG00000025903 6.82491399525833e-05     18777      Lypla1
  ENSMUSG00000103280  0.00227338281229412      <NA>     Gm37277
  ENSMUSG00000033793 1.34050472716004e-14    108664     Atp6v1h
  ENSMUSG00000051285 9.64437264692718e-12    319263      Pcmtd1
  ENSMUSG00000103509 0.000849854587410265      <NA>     Gm38372
                 ...                  ...       ...         ...
  ENSMUSG00000033478  0.00387098901394704    226252    Fam160b1
  ENSMUSG00000040022 9.31606807547629e-08     74998   Rab11fip2
  ENSMUSG00000024993  0.00311011136700511     67894      Fam45a
  ENSMUSG00000024997 1.31438732092901e-05     11757       Prdx3
  ENSMUSG00000074733 3.88962198494305e-06    414758      Zfp950
                                                                                                                                               Description
                                                                                                                                               <character>
  ENSMUSG00000025903                                                                 Acyl-protein thioesterase 1  [Source:UniProtKB/Swiss-Prot;Acc:P97823]
  ENSMUSG00000103280                                                                             predicted gene, 37277 [Source:MGI Symbol;Acc:MGI:5610505]
  ENSMUSG00000033793                                                              V-type proton ATPase subunit H  [Source:UniProtKB/Swiss-Prot;Acc:Q8BVE3]
  ENSMUSG00000051285                      protein-L-isoaspartate (D-aspartate) O-methyltransferase domain containing 1 [Source:MGI Symbol;Acc:MGI:2441773]
  ENSMUSG00000103509                                                                             predicted gene, 38372 [Source:MGI Symbol;Acc:MGI:5611600]
                 ...                                                                                                                                   ...
  ENSMUSG00000033478                                                                            Protein FAM160B1  [Source:UniProtKB/Swiss-Prot;Acc:Q8CDM8]
  ENSMUSG00000040022                                                      RAB11 family interacting protein 2 (class I) [Source:MGI Symbol;Acc:MGI:1922248]
  ENSMUSG00000024993 Mus musculus family with sequence similarity 45, member A (Fam45a), transcript variant 3, mRNA. [Source:RefSeq mRNA;Acc:NM_001347464]
  ENSMUSG00000024997                                     Thioredoxin-dependent peroxide reductase, mitochondrial  [Source:UniProtKB/Swiss-Prot;Acc:P20108]
  ENSMUSG00000074733                                                                           zinc finger protein 950 [Source:MGI Symbol;Acc:MGI:2652824]
                            Biotype         Chr     Start       End    Strand
                        <character> <character> <integer> <integer> <integer>
  ENSMUSG00000025903 protein_coding           1   4807788   4848410         1
  ENSMUSG00000103280            TEC           1   4905751   4906861        -1
  ENSMUSG00000033793 protein_coding           1   5070018   5162529         1
  ENSMUSG00000051285 protein_coding           1   7088920   7173628         1
  ENSMUSG00000103509            TEC           1   7148110   7152137         1
                 ...            ...         ...       ...       ...       ...
  ENSMUSG00000033478 protein_coding          19  57361009  57389594         1
  ENSMUSG00000040022 protein_coding          19  59902884  59943654        -1
  ENSMUSG00000024993 protein_coding          19  60811585  60836227         1
  ENSMUSG00000024997 protein_coding          19  60864051  60874556        -1
  ENSMUSG00000074733 protein_coding          19  61053840  61140840        -1
                     TopGeneLabel            score     itemRgb
                      <character>        <numeric> <character>
  ENSMUSG00000025903              4.16590281705205     #71008D
  ENSMUSG00000103280              2.64332742777324     #AA0055
  ENSMUSG00000033793               13.872731650181     #71008D
  ENSMUSG00000051285              11.0157260173554     #5500AA
  ENSMUSG00000103509              3.07065537697718     #5500AA
                 ...          ...              ...         ...
  ENSMUSG00000033478              2.41217806122792     #71008D
  ENSMUSG00000040022              7.03076734659767     #5500AA
  ENSMUSG00000024993              2.50722405945875     #71008D
  ENSMUSG00000024997              4.88127663891956     #71008D
  ENSMUSG00000074733              5.41009260377211     #71008D
  -------
  seqinfo: 21 sequences from an unspecified genome
library(rtracklayer)
export(sigRegions , con = "results/topHits.bed")

Extracting Reads

As we have been using counts as our starting point, we haven’t investigated the aligned reads from our experiment, and how they are represented. As you may be aware, aligned reads are usually stored in a bam file that can be manipulated with open-source command-line tools such as samtools and picard. Bioconductor provide a low-level interface to data/bam/sam files in the form of the Rsamtools package. The GenomicAlignments package can also be used to retrieve the reads mapping to a particular genomic region in an efficient manner.

library(GenomicAlignments)

In the directory small_bams there should be .bam files for some of the samples in the example study. The workflow to produce these files is described in a supplmentary page for the course. In brief, the raw reads (fastq) were downloaded from the Short Read Archive (SRA) and aligned with hisat2. Each bam file was named according to the file name in SRA, but we have renamed the files according to their name in the study. An index file (.bai) has been generated for each bam file. In order to reduce the size, the bam files used here only contain a subset of the reads that were aligned in the region chr15:101707000-101713000.

list.files("counts/small_bams/")
 [1] "MCL1.DG.15.sm.bam"                "MCL1.DG.15.sm.bam.bai"           
 [3] "MCL1.DH.15.sm.bam"                "MCL1.DH.15.sm.bam.bai"           
 [5] "MCL1.DI.15.sm.bam"                "MCL1.DI.15.sm.bam.bai"           
 [7] "MCL1.DJ.15.sm.bam"                "MCL1.DJ.15.sm.bam.bai"           
 [9] "MCL1.DK.15.sm.bam"                "MCL1.DK.15.sm.bam.bai"           
[11] "MCL1.DL.15.sm.bam"                "MCL1.DL.15.sm.bam.bai"           
[13] "MCL1.LA.15.sm.bam"                "MCL1.LA.15.sm.bam.bai"           
[15] "MCL1.LB.15.sm.bam"                "MCL1.LB.15.sm.bam.bai"           
[17] "MCL1.LC.15.sm.bam"                "MCL1.LC.15.sm.bam.bai"           
[19] "MCL1.LD.15.sm.bam"                "MCL1.LD.15.sm.bam.bai"           
[21] "MCL1.LE.15.sm.bam"                "MCL1.LE.15.sm.bam.bai"           
[23] "MCL1.LF.15.sm.bam"                "MCL1.LF.15.sm.bam.bai"           
[25] "Mus_musculus.GRCm38.80.chr15.gtf"

The readGAlignments function provides a simple interface to interrogate the aligned reads for a particular sample. It can also utilise the index file in order to retrieve only the reads that correspond to a specific region in an efficient manner. The output includes the genomic location of each aligned read and the CIGAR (Compact Idiosyncratic Gapped Alignment Report); where M denotes an match to the genome and I, D correspond to insertions and deletions.

exo <- exonsBy(txMm, "gene") 
generegion <- exo[["ENSMUSG00000022146"]] %>% 
    keepSeqlevels(value = 15, pruning.mode="tidy")
my.reads <- readGAlignments(file="counts/small_bams/MCL1.DG.15.sm.bam",
                       param=ScanBamParam(which=generegion))
my.reads
GAlignments object with 25419 alignments and 0 metadata columns:
          seqnames strand          cigar    qwidth     start       end
             <Rle>  <Rle>    <character> <integer> <integer> <integer>
      [1]       15      + 81M53311N11M8S       100   6799340   6852742
      [2]       15      +           100M       100   6813575   6813674
      [3]       15      +          3S97M       100   6813579   6813675
      [4]       15      +          6S94M       100   6813579   6813672
      [5]       15      +           100M       100   6813580   6813679
      ...      ...    ...            ...       ...       ...       ...
  [25415]       15      -           100M       100   6874937   6875036
  [25416]       15      -           100M       100   6874941   6875040
  [25417]       15      -          99M1S       100   6874945   6875043
  [25418]       15      +           100M       100   6874962   6875061
  [25419]       15      -           100M       100   6874966   6875065
              width     njunc
          <integer> <integer>
      [1]     53403         1
      [2]       100         0
      [3]        97         0
      [4]        94         0
      [5]       100         0
      ...       ...       ...
  [25415]       100         0
  [25416]       100         0
  [25417]        99         0
  [25418]       100         0
  [25419]       100         0
  -------
  seqinfo: 66 sequences from an unspecified genome

It is possible to tweak the function to retrieve other potentially-useful information from the bam file, such as the mapping quality and flag.

my.reads <- readGAlignments(file="counts/small_bams/MCL1.DG.15.sm.bam",
                       param=ScanBamParam(which=generegion,
                                          what=c("seq","mapq","flag")))
my.reads
GAlignments object with 25419 alignments and 3 metadata columns:
          seqnames strand          cigar    qwidth     start       end
             <Rle>  <Rle>    <character> <integer> <integer> <integer>
      [1]       15      + 81M53311N11M8S       100   6799340   6852742
      [2]       15      +           100M       100   6813575   6813674
      [3]       15      +          3S97M       100   6813579   6813675
      [4]       15      +          6S94M       100   6813579   6813672
      [5]       15      +           100M       100   6813580   6813679
      ...      ...    ...            ...       ...       ...       ...
  [25415]       15      -           100M       100   6874937   6875036
  [25416]       15      -           100M       100   6874941   6875040
  [25417]       15      -          99M1S       100   6874945   6875043
  [25418]       15      +           100M       100   6874962   6875061
  [25419]       15      -           100M       100   6874966   6875065
              width     njunc |                     seq      mapq      flag
          <integer> <integer> |          <DNAStringSet> <integer> <integer>
      [1]     53403         1 | GTTTGGAAGT...TCTCCTAAAC        60         0
      [2]       100         0 | GAAATGTTTT...ATCAATGTCA        60         0
      [3]        97         0 | TTTTGTTTTA...TCAATGTCAT        60         0
      [4]        94         0 | TTTTTTTGTT...AAATCAATGT        60         0
      [5]       100         0 | GTTTTAATTT...TGTCATTAAC        60         0
      ...       ...       ... .                     ...       ...       ...
  [25415]       100         0 | TCTCTTTATG...TTCCCACCAG        60        16
  [25416]       100         0 | TTTATGGCTG...CACCAGTCGC        60        16
  [25417]        99         0 | TGGCTGCATG...AGTCGCCAGA        60        16
  [25418]       100         0 | GTCCACAGCC...GCCTGGAGAA        60         0
  [25419]       100         0 | ACAGCCACGT...GGAGAACCGC        60        16
  -------
  seqinfo: 66 sequences from an unspecified genome

The flag can represent useful QC information. e.g.

  • Read is unmapped
  • Read is paired / unpaired
  • Read failed QC
  • Read is a PCR duplicate (see later)

The combination of any of these properties is used to derive a numeric value, as illustrated in this useful resource

Particular attributes of the reads can be extracted and visualised

hist(mcols(my.reads)$mapq, main="", xlab="MAPQ")

However, there are more-sophisticated visualisation options for aligned reads and range data. We will use the ggbio package, which first requires some discussion of the ggplot2 plotting package.

Composing plots with ggbio

We will now take a brief look at one of the visualisation packages in Bioconductor that takes advantage of the GenomicRanges and GenomicFeatures object-types. In this section we will show a worked example of how to combine several types of genomic data on the same plot. The documentation for ggbio is very extensive and contains lots of examples.

http://www.tengfei.name/ggbio/docs/

The Gviz package is another Bioconductor package that specialising in genomic visualisations, but we will not explore this package in the course.

The Manhattan plot is a common way of visualising genome-wide results, especially when one is concerned with the results of a GWAS study and identifying strongly-associated hits.

The profile is supposed to resemble the Manhattan skyline with particular skyscrapers towering about the lower level buildings.

This type of plot is implemented as the plotGrandLinear function. We have to supply a value to display on the y-axis using the aes function, which is inherited from ggplot2. The positioning of points on the x-axis is handled automatically by ggbio, using the ranges information to get the genomic coordinates of the ranges of interest.

To stop the plots from being too cluttered we will consider the top 200 genes only.

library(ggbio)
Need specific help about ggbio? try mailing 
 the maintainer or visit http://tengfei.github.com/ggbio/

Attaching package: 'ggbio'

The following objects are masked from 'package:ggplot2':

    geom_bar, geom_rect, geom_segment, ggsave, stat_bin, stat_identity,
    xlim
top200 <- sigRegions[order(sigRegions$FDR)[1:200]]
plotGrandLinear(top200 , aes(y = logFC))
using coord:genome to parse x scale

ggbio has alternated the colours of the chromosomes. However, an appealing feature of ggplot2 is the ability to map properties of your plot to variables present in your data. For example, we could create a variable to distinguish between up- and down-regulated genes. The variables used for aesthetic mapping must be present in the mcols section of your ranges object.

mcols(top200)$UpRegulated <- mcols(top200)$logFC > 0
plotGrandLinear(top200, aes(y = logFC, col = UpRegulated))
using coord:genome to parse x scale

plotGrandLinear is a special function in ggbio with preset options for the manhattan style of plot. More often, users will call the autoplot function and ggbio will choose the most appropriate layout. One such layout is the karyogram.

autoplot(top200, layout="karyogram", aes(color=UpRegulated,
                                       fill=UpRegulated))
Scale for 'x' is already present. Adding another scale for 'x', which will
replace the existing scale.
Scale for 'x' is already present. Adding another scale for 'x', which will
replace the existing scale.

ggbio is also able to plot the structure of genes according to a particular model represented by a GenomicFeatures object, such as the object we created earlier with the exon coordinates for each gene in the GRCm38 genome.

autoplot(txMm, which=exo[["ENSMUSG00000022146"]])
Parsing transcripts...
Parsing exons...
Parsing cds...
Parsing utrs...
------exons...
------cdss...
------introns...
------utr...
aggregating...
Done
Constructing graphics...

We can even plot the location of sequencing reads if they have been imported using readGAlignments function (or similar).

myreg <- exo[["ENSMUSG00000022146"]] %>% 
    GenomicRanges::reduce() %>% 
    flank(width = 1000, both = T) %>% 
    keepSeqlevels(value = 15, pruning.mode="tidy")
bam <- readGappedReads(file="counts/small_bams/MCL1.DG.15.sm.bam",
                       param=ScanBamParam(which=myreg), use.names = TRUE)
autoplot(bam, geom = "rect") + 
    xlim(GRanges("15", IRanges(6800000, 6900000)))
extracting information...

Like ggplot2, ggbio plots can be saved as objects that can later be modified, or combined together to form more complicated plots. If saved in this way, the plot will only be displayed on a plotting device when we query the object. This strategy is useful when we want to add a common element (such as an ideogram) to a plot composition and don’t want to repeat the code to generate the plot every time.

Challenge

Create tracks to compare the coverage of the gene Krt5 for the samples MCL1.DG, MCL1.DH, MCL1.LA and MCL1.LB

LS0tCnRpdGxlOiAiUk5BLXNlcSBBbmFseXNpcyBpbiBSIgpzdWJ0aXRsZTogIkFubm90YXRpb24gYW5kIFZpc3VhbGlzYXRpb24gb2YgUk5BLXNlcSByZXN1bHRzIgphdXRob3I6ICJTdGVwaGFuZSBCYWxsZXJlYXUsIE1hcmsgRHVubmluZywgT3NjYXIgUnVlZGEsIEFzaGxleSBTYXdsZSIKZGF0ZTogJ2ByIGZvcm1hdChTeXMudGltZSgpLCAiTGFzdCBtb2RpZmllZDogJWQgJWIgJVkiKWAnCm91dHB1dDoKICBodG1sX25vdGVib29rOgogICAgdG9jOiB5ZXMKICBodG1sX2RvY3VtZW50OgogICAgdG9jOiB5ZXMKbWludXRlczogMzAwCmxheW91dDogcGFnZQpiaWJsaW9ncmFwaHk6IHJlZi5iaWIKZWRpdG9yX29wdGlvbnM6IAogIGNodW5rX291dHB1dF90eXBlOiBpbmxpbmUKLS0tCgpgYGB7ciBzZXR1cCwgbWVzc2FnZT1GQUxTRX0KbGlicmFyeShiaW9tYVJ0KQpsaWJyYXJ5KERFU2VxMikKbGlicmFyeSh0aWR5dmVyc2UpCmBgYAoKQmVmb3JlIHN0YXJ0aW5nIHRoaXMgc2VjdGlvbiwgd2Ugd2lsbCBtYWtlIHN1cmUgd2UgaGF2ZSBhbGwgdGhlIHJlbGV2YW50IG9iamVjdHMKZnJvbSB0aGUgRGlmZmVyZW50aWFsIEV4cHJlc3Npb24gYW5hbHlzaXMuCgpgYGB7ciBsb2FkRGF0YX0KbG9hZCgiUm9iamVjdHMvREUuUmRhdGEiKQpgYGAKCiMgT3ZlcnZpZXcKCi0gR2V0dGluZyBhbm5vdGF0aW9uCi0gVmlzdWFsaXNpbmcgREUgcmVzdWx0cwotIFJldHJpZXZpbmcgZ2VuZSBtb2RlbHMKLSBFeHBvcnRpbmcgYnJvd3NlciB0cmFja3MKCgojIEFkZGluZyBhbm5vdGF0aW9uIHRvIHRoZSBERVNlcTIgcmVzdWx0cwoKV2UgaGF2ZSBhIGxpc3Qgb2Ygc2lnbmlmaWNhbnRseSBkaWZmZXJlbnRpYWxseSBleHByZXNzZWQgZ2VuZXMsIGJ1dCB0aGUgb25seQphbm5vdGF0aW9uIHdlIGNhbiBzZWUgaXMgdGhlIEVuc2VtYmwgR2VuZSBJRCwgd2hpY2ggaXMgbm90IHZlcnkgaW5mb3JtYXRpdmUuIAoKVGhlcmUgYXJlIGEgbnVtYmVyIG9mIHdheXMgdG8gYWRkIGFubm90YXRpb24uIE9uZSBtZXRob2QgaXMgdG8gZG8gdGhpcyB1c2luZyB0aGUKKm9yZy5NbS5lZy5kYiogcGFja2FnZS4gVGhpcyBwYWNrYWdlIGlzIG9uZSBvZiBzZXZlcmFsICpvcmdhbmlzbS1sZXZlbCogcGFja2FnZXMKd2hpY2ggYXJlIHJlLWJ1aWx0IGV2ZXJ5IDYgbW9udGhzLiBUaGVzZSBwYWNrYWdlcyBhcmUgbGlzdGVkIG9uIHRoZSBbYW5ub3RhdGlvbiAKc2VjdGlvbl0oaHR0cDovL2Jpb2NvbmR1Y3Rvci5vcmcvcGFja2FnZXMvcmVsZWFzZS9CaW9jVmlld3MuaHRtbCNfX19Bbm5vdGF0aW9uRGF0YSkgCm9mIHRoZSBCaW9jb25kdWN0b3IsIGFuZCBhcmUgaW5zdGFsbGVkIGluIHRoZSBzYW1lIHdheSBhcyByZWd1bGFyIEJpb2NvbmR1Y3RvciAKcGFja2FnZXMuIAoKQW4gYWx0ZXJuYXRpdmUgYXBwcm9hY2ggaXMgdG8gdXNlIGBiaW9tYVJ0YCwgYW4gaW50ZXJmYWNlIHRvIHRoZSAKW0Jpb01hcnRdKGh0dHA6Ly93d3cuYmlvbWFydC5vcmcvKSByZXNvdXJjZS4gVGhpcyBpcyB0aGUgbWV0aG9kIHdlIHdpbGwgdXNlIAp0b2RheS4KCiMjIFNlbGVjdCBCaW9NYXJ0IGRhdGFiYXNlIGFuZCBkYXRhc2V0CgpUaGUgZmlyc3Qgc3RlcCBpcyB0byBzZWxlY3QgdGhlIEJpb21hcnQgZGF0YWJhc2Ugd2UgYXJlIGdvaW5nIHRvIGFjY2VzcyBhbmQgCndoaWNoIGRhdGEgc2V0IHdlIGFyZSBnb2luZyB0byB1c2UuCgpgYGB7ciBjb25uZWN0fQojIHZpZXcgdGhlIGF2YWlsYWJsZSBkYXRhYmFzZXMKbGlzdE1hcnRzKCkKIyMgc2V0IHVwIGNvbm5lY3Rpb24gdG8gZW5zZW1ibCBkYXRhYmFzZQplbnNlbWJsPXVzZU1hcnQoIkVOU0VNQkxfTUFSVF9FTlNFTUJMIikKCiMgbGlzdCB0aGUgYXZhaWxhYmxlIGRhdGFzZXRzIChzcGVjaWVzKQpsaXN0RGF0YXNldHMoZW5zZW1ibCkgJT4lIAogICAgZmlsdGVyKHN0cl9kZXRlY3QoZGVzY3JpcHRpb24sICJNb3VzZSIpKQoKIyBzcGVjaWZ5IGEgZGF0YSBzZXQgdG8gdXNlCmVuc2VtYmwgPSB1c2VEYXRhc2V0KCJtbXVzY3VsdXNfZ2VuZV9lbnNlbWJsIiwgbWFydD1lbnNlbWJsKQpgYGAKCiMjIFF1ZXJ5IHRoZSBkYXRhYmFzZQoKTm93IHdlIG5lZWQgdG8gc2V0IHVwIGEgcXVlcnkuIEZvciB0aGlzIHdlIG5lZWQgdG8gc3BlY2lmeSB0aHJlZSB0aGluZ3M6IAoKKGEpIFdoYXQgdHlwZSBvZiBpbmZvcm1hdGlvbiB3ZSBhcmUgZ29pbmcgdG8gc2VhcmNoIHRoZSBkYXRhc2V0IG9uIC0gY2FsbGVkCioqZmlsdGVycyoqLiBJbiBvdXIgY2FzZSB0aGlzIGlzIEVuc2VtYmwgR2VuZSBJRHMKKGIpIEEgdmVjdG9yIG9mIHRoZSAqKnZhbHVlcyoqIGZvciBvdXIgZmlsdGVyIC0gdGhlIEVuc2VtYmwgR2VuZSBJRHMgZnJvbSBvdXIgREUgCnJlc3VsdHMgdGFibGUKKGMpIFdoYXQgY29sdW1ucyAoKiphdHRyaWJ1dGVzKiopIG9mIHRoZSBkYXRhc2V0IHdlIHdhbnQgcmV0dXJuZWQuCgpSZXR1cm5pbmcgZGF0YSBmcm9tIEJpb21hcnQgY2FuIHRha2UgdGltZSwgc28gaXQncyBhbHdheXMgYSBnb29kIGlkZWEgdG8gdGVzdCAKeW91ciBxdWVyeSBvbiBhIHNtYWxsIGxpc3Qgb2YgdmFsdWVzIGZpcnN0IHRvIG1ha2Ugc3VyZSBpdCBpcyBkb2luZyB3aGF0IHlvdQp3YW50LiBXZSdsbCBqdXN0IHVzZSB0aGUgZmlyc3QgMTAwMCBnZW5lcyBmb3Igbm93LgoKYGBge3IgcXVlcnlCaW9NYXJ0LCBtZXNzYWdlPUZ9CgojIGNoZWNrIHRoZSBhdmFpbGFibGUgImZpbHRlcnMiIC0gdGhpbmdzIHlvdSBjYW4gZmlsdGVyIGZvcgpsaXN0RmlsdGVycyhlbnNlbWJsKSAlPiUgCiAgICBmaWx0ZXIoc3RyX2RldGVjdChuYW1lLCAiZW5zZW1ibCIpKQojIFNldCB0aGUgZmlsdGVyIHR5cGUgYW5kIHZhbHVlcwpmaWx0ZXJUeXBlIDwtICJlbnNlbWJsX2dlbmVfaWQiCmZpbHRlclZhbHVlcyA8LSByb3duYW1lcyhyZXNMdlYpWzE6MTAwMF0KCiMgY2hlY2sgdGhlIGF2YWlsYWJsZSAiYXR0cmlidXRlcyIgLSB0aGluZ3MgeW91IGNhbiByZXRyZWl2ZQpsaXN0QXR0cmlidXRlcyhlbnNlbWJsKSAlPiUgCiAgICBoZWFkKDIwKQojIFNldCB0aGUgbGlzdCBvZiBhdHRyaWJ1dGVzCmF0dHJpYnV0ZU5hbWVzIDwtIGMoJ2Vuc2VtYmxfZ2VuZV9pZCcsICdlbnRyZXpnZW5lJywgJ2V4dGVybmFsX2dlbmVfbmFtZScpCgojIHJ1biB0aGUgcXVlcnkKYW5ub3QgPC0gZ2V0Qk0oYXR0cmlidXRlcz1hdHRyaWJ1dGVOYW1lcywgCiAgICAgICAgICAgICAgIGZpbHRlcnMgPSBmaWx0ZXJUeXBlLCAKICAgICAgICAgICAgICAgdmFsdWVzID0gZmlsdGVyVmFsdWVzLCAKICAgICAgICAgICAgICAgbWFydCA9IGVuc2VtYmwpCmBgYAoKCiMjIyBPbmUtdG8tbWFueSByZWxhdGlvbnNoaXBzCgpMZXQncyBpbnNwZWN0IHRoZSBhbm5vdGF0aW9uLgoKYGBge3IgaW5zcGVjdEFubm90fQpoZWFkKGFubm90KQpkaW0oYW5ub3QpICMgd2h5IGFyZSB0aGVyZSBtb3JlIHRoYW4gMTAwMCByb3dzPwpsZW5ndGgodW5pcXVlKGFubm90JGVuc2VtYmxfZ2VuZV9pZCkpICMgd2h5IGFyZSB0aGVyZSBsZXNzIHRoYW4gMTAwMCBHZW5lIGlkcz8KCmlzRHVwIDwtIGR1cGxpY2F0ZWQoYW5ub3QkZW5zZW1ibF9nZW5lX2lkKQpkdXAgPC0gYW5ub3QkZW5zZW1ibF9nZW5lX2lkW2lzRHVwXQphbm5vdFthbm5vdCRlbnNlbWJsX2dlbmVfaWQlaW4lZHVwLF0KYGBgCgpUaGVyZSBhcmUgYSBjb3VwbGUgb2YgZ2VuZXMgdGhhdCBoYXZlIG11bHRpcGxlIGVudHJpZXMgaW4gdGhlIHJldHJpZXZlZCAKYW5ub3RhdGlvbi4gVGhpcyBpcyBiZWNhdWVzIHRoZXJlIGFyZSBtdWx0aXBsZSBFbnRyZXogSURzIGZvciBhIHNpbmdsZSBFbnNlbWJsIApnZW5lLiBUaGVzZSBvbmUtdG8tbWFueSByZWxhdGlvbnNoaXBzIGNvbWUgdXAgZnJlcXVlbnRseSBpbiBnZW5vbWljIGRhdGFiYXNlcywgCml0IGlzIGltcG9ydGFudCB0byBiZSBhd2FyZSBvZiB0aGVtIGFuZCBjaGVjayB3aGVuIG5lY2Vzc2FyeS4gCgpXZSB3aWxsIG5lZWQgdG8gZG8gYSBsaXR0bGUgd29yayBiZWZvcmUgYWRkaW5nIHRoZSBhbm5vdGF0aW9uIHRvIG91dCByZXN1bHRzIAp0YWJsZS4gV2UgY291bGQgZGVjaWRlIHRvIGRpc2NhcmQgb25lIG9yIGJvdGggb2YgdGhlIEVudHJleiBJRCBtYXBwaW5ncywgb3Igd2UgCmNvdWxkIGNvbmNhdGVuYXRlIHRoZSBFbnRyZXogSURzIHNvIHRoYXQgd2UgZG9uJ3QgbG9zZSBpbmZvcm1hdGlvbi4gCgojIyMgUmV0cmlldmUgZnVsbCBhbm5vdGF0aW9uCgo+ICMjIyBDaGFsbGVuZ2Ugey5jaGFsbGVuZ2V9Cj4gVGhhdCB3YXMganVzdCAxMDAwIGdlbmVzLiBXZSBuZWVkIGFubm90YXRpb25zIGZvciB0aGUgZW50aXJlIHJlc3VsdHMgdGFibGUuCj4gQWxzbywgdGhlcmUgbWF5IGJlIHNvbWUgb3RoZXIgaW50ZXJlc3RpbmcgY29sdW1ucyBpbiBCaW9NYXJ0IHRoYXQgd2Ugd2lzaCB0bwo+IHJldHJpZXZlLiAgCj4KPiAoYSkgU2VhcmNoIHRoZSBhdHRyaWJ1dGVzIGFuZCBhZGQgdGhlIGZvbGxvd2luZyB0byBvdXIgbGlzdCBvZiBhdHRyaWJ1dGVzOiAgCj4gICAgICAgKGkpIFRoZSBnZW5lIGRlc2NyaXB0aW9uICAgCj4gICAgICAgKGlpKSBUaGUgZ2Vub21pYyBwb3NpdGlvbiAtIGNocm9tb3NvbWUsIHN0YXJ0LCBlbmQsIGFuZCBzdHJhbmQgKDQgY29sdW1ucykgCj4gICAgICAgKGlpaSkgVGhlIGdlbmUgYmlvdHlwZSAgCj4gKGIpIFF1ZXJ5IEJpb01hcnQgdXNpbmcgYWxsIG9mIHRoZSBnZW5lcyBpbiBvdXIgcmVzdWx0cyB0YWJsZSAoYHJlc0x2VmApICAKPiAoYykgSG93IG1hbnkgRW5zZW1ibCBnZW5lcyBoYXZlIG11bHRpcGUgRW50cmV6IElEcyBhc3NvY2lhdGVkIHdpdGggdGhlbT8gIAo+IChkKSBIb3cgbWFueSBFbnNlbWJsIGdlbmVzIGluIGByZXNMdlZgIGRvbid0IGhhdmUgYW55IGFubm90YXRpb24/IFdoeSBpcyB0aGlzPwoKYGBge3Igc29sdXRpb25DaGFsbGVuZ2UxfQojIGZpbHRlclZhbHVlcyA8LSByb3duYW1lcyhyZXNMdlYpCiMgCiMgIyBjaGVjayB0aGUgYXZhaWxhYmxlICJhdHRyaWJ1dGVzIiAtIHRoaW5ncyB5b3UgY2FuIHJldHJlaXZlCiMgbGlzdEF0dHJpYnV0ZXMoZW5zZW1ibCkgJT4lCiMgICAgIGhlYWQoMjApCiMgYXR0cmlidXRlTmFtZXMgPC0gYygnZW5zZW1ibF9nZW5lX2lkJywgCiMgICAgICAgICAgICAgICAgICAgICAnZW50cmV6Z2VuZScsCiMgICAgICAgICAgICAgICAgICAgICAnZXh0ZXJuYWxfZ2VuZV9uYW1lJywKIyAgICAgICAgICAgICAgICAgICAgICdkZXNjcmlwdGlvbicsCiMgICAgICAgICAgICAgICAgICAgICAnZ2VuZV9iaW90eXBlJywKIyAgICAgICAgICAgICAgICAgICAgICdjaHJvbW9zb21lX25hbWUnLAojICAgICAgICAgICAgICAgICAgICAgJ3N0YXJ0X3Bvc2l0aW9uJywKIyAgICAgICAgICAgICAgICAgICAgICdlbmRfcG9zaXRpb24nLAojICAgICAgICAgICAgICAgICAgICAgJ3N0cmFuZCcpCiMgCiMgIyBydW4gdGhlIHF1ZXJ5CiMgYW5ub3QgPC0gZ2V0Qk0oYXR0cmlidXRlcz1hdHRyaWJ1dGVOYW1lcywKIyAgICAgICAgICAgICAgICBmaWx0ZXJzID0gZmlsdGVyVHlwZSwKIyAgICAgICAgICAgICAgICB2YWx1ZXMgPSBmaWx0ZXJWYWx1ZXMsCiMgICAgICAgICAgICAgICAgbWFydCA9IGVuc2VtYmwpCiMgCiMgc3VtKGR1cGxpY2F0ZWQoYW5ub3QkZW5zZW1ibF9nZW5lX2lkKSkKIyBtaXNzaW5nR2VuZXMgPC0gIXJvd25hbWVzKHJlc0x2ViklaW4lYW5ub3QkZW5zZW1ibF9nZW5lX2lkCiMgcm93bmFtZXMocmVzTHZWKVttaXNzaW5nR2VuZXNdCmBgYAoKIyMjIEFkZCBhbm5vdGF0aW9uIHRvIHRoZSByZXN1bHRzIHRhYmxlCgpXZSBjYW4gbm93IGFkZCB0aGUgYW5ub3RhdGlvbiB0byB0aGUgcmVzdWx0cyB0YWJsZSBhbmQgdGhlbiBzYXZlIHRoZSByZXN1bHRzIAp1c2luZyB0aGUgYHdyaXRlX3RzdmAgZnVuY3Rpb24sIHdoaWNoIHdyaXRlcyB0aGUgcmVzdWx0cyBvdXQgdG8gYSB0YWIKc2VwYXJhdGVkIGZpbGUuClRvIHNhdmUgdGltZSB3ZSBoYXZlIGNyZWF0ZWQgYW4gYW5ub3RhdGlvbiB0YWJsZSBpbiB3aGljaCB3ZSBoYXZlIG1vZGlmaWVkIHRoZSAKY3VtYmVyc29tZSBCaW9tYXJ0IGNvbHVtbiBuYW1lcywgYW5kIGRlYWx0IHdpdGggdGhlIG9uZS10by1tYW55IGlzc3VlcyBmb3IgCkVudHJleiBJRHMuCgpgYGB7ciBhZGRBbm5vdGF0aW9uLCBtZXNzYWdlPUZBTFNFfQplbnNlbWJsQW5ub3QgPC0gcmVhZF90c3YoImRhdGEvRW5zZW1ibF9hbm5vdGF0aW9ucy50c3YiKQpjb2xuYW1lcyhlbnNlbWJsQW5ub3QpCnJlc1RhYiA8LSBhcy5kYXRhLmZyYW1lKHJlc0x2VikgJT4lIAogICAgcm93bmFtZXNfdG9fY29sdW1uKCJHZW5lSUQiKSAlPiUgCiAgICBsZWZ0X2pvaW4oZW5zZW1ibEFubm90LCAiR2VuZUlEIikgJT4lIAogICAgcmVuYW1lKGxvZ0ZDPWxvZzJGb2xkQ2hhbmdlLCBGRFI9cGFkaikKYGBgCgpGaW5hbGx5IHdlIGNhbiBvdXRwdXQgdGhlIGFubm90YXRpb24gREUgcmVzdWx0cyB1c2luZyBgd3JpdGVfY3N2YC4KCmBgYHtyIG91dHB1dERFdGFibGVzLCBldmFsPUZ9CndyaXRlX3RzdihyZXNUYWIsICJyZXN1bHRzL1ZpcmdpblZzTGFjdGF0aW5nX1Jlc3VsdHNfQW5ub3RhdGVkLnR4dCIpCmBgYAoKPiAjIyMgQ2hhbGxlbmdlIHsuY2hhbGxlbmdlfQo+IEhhdmUgYSBsb29rIGF0IGdlbmUgc3ltYm9scyBmb3IgbW9zdCBzaWduaWZpY2FudCBnZW5lcyBieSBhZGp1c3RlZCBwLXZhbHVlLgo+IERvIHRoZXkgbWFrZSBiaW9sb2dpY2FsIHNlbnNlIGluIHRoZSBjb250ZXh0IG9mIGNvbXBhcmluZyBnZW5lIGV4cHJlc3Npb24KPiBpbiBtYW1tYXJ5IGdsYW5kIHRpc3N1ZSBiZXR3ZWVuIGxhY3RhdGluZyBhbmQgdmlyZ2luIG1pY2U/IFlvdSBtYXkgd2FudCB0bwo+IGRvIGEgcXVpY2sgd2ViIHNlYXJjaCBvZiB5b3VyIGZhdm91cml0ZSBnZW5lL3Byb3RlaW4gZGF0YWJhc2UKCjwhLS0gYGBge3IgdG9wR2VuZXN9IC0tPgo8IS0tIHJlc1RhYiAlPiUgIC0tPgo8IS0tICAgICBhcnJhbmdlKEZEUikgJT4lICAtLT4KPCEtLSAgICAgc2VsZWN0KFN5bWJvbCkgLS0+CjwhLS0gICAgIGhlYWQoMTApIC0tPgo8IS0tIGBgYCAtLT4KCiMgVmlzdWFsaXNhdGlvbgoKYERFU2VxMmAgcHJvdmlkZXMgYSBmdW5jdG9uIGNhbGxlZCBgbGZjU2hyaW5rYCB0aGF0IHNocmlua3MgbG9nLUZvbGQgQ2hhbmdlIAooTEZDKSBlc3RpbWF0ZXMgdG93YXJkcyB6ZXJvIHVzaW5nIGFuZCBlbXBpcmljYWwgQmF5ZXMgcHJvY2VkdXJlLiBUaGUgcmVhc29uIGZvcgpkb2luZyB0aGlzIGlzIHRoYXQgdGhlcmUgaXMgaGlnaCB2YXJpYW5jZSBpbiB0aGUgTEZDIGVzdGltYXRlcyB3aGVuIGNvdW50cyBhcmUgCmxvdyBhbmQgdGhpcyByZXN1bHRzIGluIGxvd2x5IGV4cHJlc3NlZCBnZW5lcyBhcHBlYXJpbmcgdG8gYmUgc2hvdyBncmVhdGVyCmRpZmZlcmVuY2VzIGJldHdlZW4gZ3JvdXBzIHRoYXQgaGlnaGx5IGV4cHJlc3NlZCBnZW5lcy4gVGhlIGBsZmNTaHJpbmtgIG1ldGhvZApjb21wZW5zYXRlcyBmb3IgdGhpcyBhbmQgYWxsb3dzIGJldHRlciB2aXN1YWxpc2F0aW9uIGFuZCByYW5raW5nIG9mIGdlbmVzLiBXZSAKd2lsbCB1c2UgaXQgZm9yIG91ciB2aXN1YWxpc2F0aW9ucyBvZiB0aGUgZGF0YS4KCmBgYHtyfQpkZHNTaHJpbmsgPC0gbGZjU2hyaW5rKGRkc09iaiwgY29lZj0iU3RhdHVzX2xhY3RhdGVfdnNfdmlyZ2luIikKcmVzVGFiIDwtIGRkc1NocmluayAlPiUgCiAgICBhcy5kYXRhLmZyYW1lKCkgJT4lIAogICAgcm93bmFtZXNfdG9fY29sdW1uKCJHZW5lSUQiKSAlPiUgCiAgICBsZWZ0X2pvaW4oZW5zZW1ibEFubm90LCAiR2VuZUlEIikgJT4lIAogICAgcmVuYW1lKGxvZ0ZDPWxvZzJGb2xkQ2hhbmdlLCBGRFI9cGFkaikKYGBgCgojIyBQLXZhbHVlIGhpc3RvZ3JhbQoKQSBxdWljayBhbmQgZWFzeSAic2FuaXR5IGNoZWNrIiBmb3Igb3VyIERFIHJlc3VsdHMgaXMgdG8gZ2VuZXJhdGUgYSBwLXZhbHVlIApoaXN0b2dyYW0uIFdoYXQgd2Ugc2hvdWxkIHNlZSBpcyBhIGhpZ2ggYmFyIGluIHRoZSBgMCAtIDAuMDVgIGFuZCB0aGVuIGEgcm91Z2hseQp1bmlmb3JtIHRhaWwgdG8gdGhlIHJpZ2h0IG9mIHRoaXMuIFRoZXJlIGlzIGEgbmljZSBleHBsYW5hdGlvbiBvZiBvdGhlciBwb3NzaWJsZQpwYXR0ZXJucyBpbiB0aGUgaGlzdG9ncmFtIGFuZCB3aGF0IHRvIHdoZW4geW91IHNlZSB0aGVtIGluIFt0aGlzIApwb3N0XShodHRwOi8vdmFyaWFuY2VleHBsYWluZWQub3JnL3N0YXRpc3RpY3MvaW50ZXJwcmV0aW5nLXB2YWx1ZS1oaXN0b2dyYW0vKS4KCmBgYHtyIHB2YWxIaXN0LCBmaWcuYWxpZ249ImNlbnRlciIsIGZpZy53aWR0aD01LCBmaWcuaGVpZ2h0PTV9Cmhpc3QocmVzVGFiJHB2YWx1ZSkKYGBgCgojIyBNQSBwbG90cwoKTUEgcGxvdHMgYXJlIGEgY29tbW9uIHdheSB0byB2aXN1YWxpemUgdGhlIHJlc3VsdHMgb2YgYSBkaWZmZXJlbnRpYWwgYW5hbHlzaXMuIApXZSBtZXQgdGhlbSBicmllZmx5IHRvd2FyZHMgdGhlIGVuZCBvZiBbU2Vzc2lvbiAKMl0oMDJfUHJlcHJvY2Vzc2luZ19EYXRhLm5iLmh0bWwpLiBUaGlzIHBsb3Qgc2hvd3MgdGhlIGxvZy1Gb2xkIENoYW5nZSBmb3IgZWFjaCAKZ2VuZSBhZ2FpbnN0IGl0cyBhdmVyYWdlIGV4cHJlc3Npb24gYWNyb3NzIGFsbCBzYW1wbGVzIGluIHRoZSB0d28gY29uZGl0aW9ucwpiZWluZyBjb250cmFzdGVkLgoKYERFU2VxMmAgaGFzIGEgaGFuZHkgZnVuY3Rpb24gZm9yIHBsb3R0aW5nIHRoaXMuLi4KCmBgYHtyIG1hUGxvdERFU2VxMiwgZmlnLmFsaWduPSJjZW50ZXIiLCBmaWcud2lkdGg9NywgZmlnLmhlaWdodD01fQpwbG90TUEoZGRzU2hyaW5rLCBhbHBoYT0wLjA1KQpgYGAKCi4uLnRoaXMgaXMgZmluZSBmb3IgYSBxdWljayBsb29rLCBidXQgaXQgaXMgbm90IGVhc3kgdG8gbWFrZSBjaGFuZ2VzIHRvIHRoZSB3YXkKaXQgbG9va3Mgb3IgYWRkIHRoaW5ncyBzdWNoIGFzIGdlbmUgbGFiZWxzLiBQZXJoYXBzIHdlIHdvdWxkIGxpa2UgdG8gYWRkIGxhYmVscwpmb3IgdGhlIHRvcCAyMCBtb3N0IHNpZ25pZmljYW50bHkgZGlmZmVyZW50aWFsbHkgZXhwcmVzc2VkIGdlbmVzLiBMZXQncyB1c2UgCmdncGxvdDIgaW5zdGVhZC4KCmBgYHtyIG1hUGxvdCwgZmlnLmFsaWduPSJjZW50ZXIiLCBmaWcud2lkdGg9NywgZmlnLmhlaWdodD03fQojIGFkZCBhIGNvbHVtbiB3aXRoIHRoZSBuYW1lcyBvZiBvbmx5IHRoZSB0b3AgMTAgZ2VuZXMKY3V0b2ZmIDwtIHNvcnQocmVzVGFiJHB2YWx1ZSlbMTBdCnJlc1RhYiA8LSByZXNUYWIgJT4lIAogICAgbXV0YXRlKFRvcEdlbmVMYWJlbD1pZmVsc2UocHZhbHVlPD1jdXRvZmYsIFN5bWJvbCwgIiIpKQoKZ2dwbG90KHJlc1RhYiwgYWVzKHggPSBsb2cyKGJhc2VNZWFuKSwgeT1sb2dGQykpICsgCiAgICBnZW9tX3BvaW50KGFlcyhjb2xvdXI9RkRSIDwgMC4wNSksIHBjaD0yMCwgc2l6ZT0wLjUpICsKICAgIGdlb21fdGV4dChhZXMobGFiZWw9VG9wR2VuZUxhYmVsKSkgKwogICAgbGFicyh4PSJtZWFuIG9mIG5vcm1hbGlzZWQgY291bnRzIiwgeT0ibG9nIGZvbGQgY2hhbmdlIikKYGBgCgoKCgojIyBWb2xjYW5vIHBsb3QKCkFub3RoZXIgY29tbW9uIHZpc3VhbGlzYXRpb24gaXMgdGhlIApbKnZvbGNhbm8gcGxvdCpdKGh0dHBzOi8vZW4ud2lraXBlZGlhLm9yZy93aWtpL1ZvbGNhbm9fcGxvdF8oc3RhdGlzdGljcykpIHdoaWNoIApkaXNwbGF5cyBhIG1lYXN1cmUgb2Ygc2lnbmlmaWNhbmNlIG9uIHRoZSB5LWF4aXMgYW5kIGZvbGQtY2hhbmdlIG9uIHRoZSB4LWF4aXMuIApJbiB0aGlzIGNhc2Ugd2UgdXNlIHRoZSBsb2cyIGZvbGQgY2hhbmdlIChgbG9nRkNgKSBvbiB0aGUgeC1heGlzLCBhbmQgb24gdGhlIAp5LWF4aXMgd2UnbGwgdXNlIGAtbG9nMTAoRkRSKWAuIFRoaXMgYC1sb2cxMGAgdHJhbnNmb3JtYXRpb24gaXMgY29tbW9ubHkgdXNlZApmb3IgcC12YWx1ZXMgYXMgaXQgbWVhbnMgdGhhdCBtb3JlIHNpZ25pZmljYW50IGdlbmVzIGhhdmUgYSBoaWdoZXIgc2NhbGUuCldlIHNob3VsZCBmaXJzdCByZW1vdmUgdGhlIGdlbmVzIHRoYXQgd2UgZXhjbHVkZWQgYnkgdGhlIGluZGVwZW5kZW50IGZpbHRlcmluZwpwcm9jZXNzIG9mIERFU2VxMgoKYGBge3Igdm9sY2Fub1Bsb3QsIGZpZy5oZWlnaHQ9NSwgZmlnLndpZHRoPTEwfQojIGZpcnN0IHJlbW92ZSB0aGUgZmlsdGVyZWQgZ2VuZXMgKEZEUj1OQSkgYW5kIGNyZWF0ZSBhIC1sb2cxMChGRFIpIGNvbHVtbgpmaWx0VGFiIDwtIHJlc1RhYiAlPiUgCiAgICBmaWx0ZXIoIWlzLm5hKEZEUikpICU+JSAKICAgIG11dGF0ZShgLWxvZzEwKEZEUilgID0gLWxvZzEwKEZEUikpCgpnZ3Bsb3QoZmlsdFRhYiwgYWVzKHggPSBsb2dGQywgeT1gLWxvZzEwKEZEUilgKSkgKyAKICAgIGdlb21fcG9pbnQoYWVzKGNvbG91cj1GRFIgPCAwLjA1KSwgc2l6ZT0yKQpgYGAKCldlIGNvdWxkIGxpbWl0IHRoZSB2YWx1ZXMgYXQgdGhlIHRvcCBvZiB0aGUgcGxvdCBzbyB0aGF0IHdlIGNhbiBzZWUgdGhlIGxvd2VyCiBwb3J0aW9uIG1vcmUgY2xlYXJseS4KIApgYGB7ciB2b2xjYW5vUGxvdEx0ZCwgZmlnLmhlaWdodD01LCBmaWcud2lkdGg9MTB9CmZpbHRUYWIgPC0gZmlsdFRhYiAlPiUgCiAgICBtdXRhdGUoYC1sb2cxMChGRFIpYD1wbWluKGAtbG9nMTAoRkRSKWAsIDUxKSkKCmdncGxvdChmaWx0VGFiLCBhZXMoeCA9IGxvZ0ZDLCB5PWAtbG9nMTAoRkRSKWApKSArIAogICAgZ2VvbV9wb2ludChhZXMoY29sb3VyPUZEUiA8IDAuMDUsIHNoYXBlID0gYC1sb2cxMChGRFIpYCA+IDUwKSwgc2l6ZT0yKQpgYGAKCiMjIyBTdHJpcCBjaGFydCBmb3IgZ2VuZSBleHByZXNzaW9uCgpCZWZvcmUgZm9sbG93aW5nIHVwIG9uIHRoZSBERSBnZW5lcyB3aXRoIGZ1cnRoZXIgbGFiIHdvcmssIGEgcmVjb21tZW5kZWQgKnNhbml0eQpjaGVjayogaXMgdG8gaGF2ZSBhIGxvb2sgYXQgdGhlIGV4cHJlc3Npb24gbGV2ZWxzIG9mIHRoZSBpbmRpdmlkdWFsIHNhbXBsZXMgZm9yIAp0aGUgZ2VuZXMgb2YgaW50ZXJlc3QuIFdlIGNhbiBxdWlja2x5IGxvb2sgYXQgZ3JvdXBlZCBleHByZXNzaW9uIHVzaW5nIApgc3RyaXBjaGFydGAuIFdlIGNhbiByZXRyaWV2ZSB0aGUgbm9ybWFsaXNlZCBleHByZXNzaW9uIHZhbHVlcyBpbiB0aGUgCmBkZHNPYmpgIG9iamVjdCB1c2luZyB0aGUgYGNvdW50c2AgZnVuY3Rpb24gZnJvbSBERVNlcTIuCgpgYGB7ciBnZW5lQ291bnRTdHJpcGNoYXJ0LCBmaWcud2lkdGg9NSwgZmlnLmhlaWdodD01LCBmaWcuYWxpZ249ImNlbnRlciJ9Cm5vcm1Db3VudHMgPC0gY291bnRzKGRkc09iaiwgbm9ybWFsaXplZD1UUlVFKSAlPiUgCiAgICBsb2cyKCkKCiMgTGV0J3MgbG9vayBhdCB0aGUgbW9zdCBzaWduaWZpY2FudGx5IGRpZmZlcmVudGlhbGx5IGV4cHJlc3NlZCBnZW5lOiBXYXAKdG9wZ2VuZSA8LSBmaWx0ZXIocmVzVGFiLCBTeW1ib2w9PSJXYXAiKQp0b3BnZW5lCgpncm91cHMgPC0gY29sRGF0YShkZHNPYmopJEdyb3VwCnBhcihtYXI9Yyg4LDQsMiwyKSkgI2FkanVzdCB0aGUgcGxvdCBtYXJnaW5zIHRoZSB4LWxhYmVscyBhcmUgdmlzaWJsZSAtIHNlZSA/cGFyCnN0cmlwY2hhcnQobm9ybUNvdW50c1siRU5TTVVTRzAwMDAwMDAwMzgxIixdfmdyb3VwcywKICAgICAgICAgICBjb2w9MTo2LAogICAgICAgICAgIHZlcnRpY2FsPVRSVUUsCiAgICAgICAgICAgcGNoPTIxLAogICAgICAgICAgIGxhcz0yLAogICAgICAgICAgIGNleD0yLAogICAgICAgICAgIHhsYWI9IiIsCiAgICAgICAgICAgeWxhYj0ibG9nMihDb3VudHMpIiwKICAgICAgICAgICBtYWluPSJOb3JtYWxpc2VkIENvdW50cyAtIFdhcCIpCmBgYAoKIyMjIEludGVyYWN0aXZlIFN0cmlwQ2hhcnQgd2l0aCBHbGltbWEKCkFuIGludGVyYWN0aXZlIHZlcnNpb24gb2YgdGhlIHZvbGNhbm8gcGxvdCBhYm92ZSB0aGF0IGluY2x1ZGVzIHRoZSByYXcgcGVyIApzYW1wbGUgdmFsdWVzIGluIGEgc2VwYXJhdGUgcGFuZWwgaXMgcG9zc2libGUgdmlhIHRoZSBgZ2xYWVBsb3RgIGZ1bmN0aW9uIGluIHRoZQoqR2xpbW1hKiBwYWNrYWdlLgoKCmBgYHtyfQpsaWJyYXJ5KEdsaW1tYSkKZ3JvdXAgPC0gYXMuZmFjdG9yKHNhbXBsZWluZm8kR3JvdXApCmxldmVscyhncm91cCkgPC0gYygiYmFzYWwubGFjdCIsImJhc2FsLnByZWciLCJiYXNhbC52aXIiLAogICAgICAgICAgICAgICAgICAgImx1bS5sYWN0IiwgImx1bS5wcmVnIiwgImx1bS52aXIiKQphbm5vdC5tb2QgPC0gZmlsdFRhYlssYygiR2VuZUlEIiwgIlN5bWJvbCIsICJEZXNjcmlwdGlvbiIpXQpkZSA8LSBhcy5udW1lcmljKGZpbHRUYWIkRkRSPD0wLjA1KQpmaWx0Q291bnRzIDwtIG5vcm1Db3VudHNbZmlsdFRhYiRHZW5lSUQsXQpnbFhZUGxvdCh4PWZpbHRUYWIkbG9nRkMsIHk9LWxvZzEwKGZpbHRUYWIkRkRSKSwKICAgICAgICAgeGxhYj0ibG9nRkMiLCB5bGFiPSJGRFIiLCBtYWluPSJMYWN0YXRpbmcgdiBWaXJnaW4iLAogICAgICAgICBjb3VudHM9ZmlsdENvdW50cywgZ3JvdXBzPWdyb3VwLCBzdGF0dXM9ZGUsCiAgICAgICAgIGFubm89YW5ub3QubW9kLCBpZC5jb2x1bW49IkVOVFJFWklEIiwgZm9sZGVyPSJ2b2xjYW5vIikKYGBgCgpUaGlzIGZ1bmN0aW9uIGNyZWF0ZXMgYW4gaHRtbCBwYWdlICguL3ZvbGNhbm8vWFktUGxvdC5odG1sKSB3aXRoIGEgdm9sY2FubyBwbG90IApvbiB0aGUgbGVmdCBhbmQgYSBwbG90IHNob3dpbmcgdGhlIGxvZy1DUE0gcGVyIHNhbXBsZSBmb3IgYSBzZWxlY3RlZCBnZW5lIG9uIHRoZQpyaWdodC4gQSBzZWFyY2ggYmFyIGlzIGF2YWlsYWJsZSB0byBzZWFyY2ggZm9yIGdlbmVzIG9mIGludGVyZXN0LgoKCiMjIEFkZGl0aW9uYWwgTWF0ZXJpYWwKIyMjIFJldHJpZXZpbmcgRGV0YWlsZWQgR2Vub21pYyBMb2NhdGlvbnMKCgouIFRoZXJlIGlzIGEgd2hvbGUgc3VpdGUgb2YgYW5ub3RhdGlvbiBwYWNrYWdlcyB0aGF0IGNhbiBiZSB1c2VkIHRvIAphY2Nlc3MgdGhpcyBpbmZvcm1hdGlvbiwgYW5kIGZvciBwZXJmb3JtaW5nIG1vcmUtYWR2YW5jZWQgcXVlcmllcyB0aGF0IHJlbGF0ZSB0bwp0aGUgbG9jYXRpb24gb2YgZ2VuZXMuIFRoZXNlIGFyZSBsaXN0ZWQgb24gdGhlIEJpb2NvbmR1Y3RvciBbYW5ub3RhdGlvbiAKcGFnZV0oaHR0cDovL2Jpb2NvbmR1Y3Rvci5vcmcvcGFja2FnZXMvcmVsZWFzZS9CaW9jVmlld3MuaHRtbCNfX19Bbm5vdGF0aW9uRGF0YSkKYW5kIGhhdmUgdGhlIHByZWZpeCBgVHhEYi5gICh3aGVyZSAidHgiIGlzICJ0cmFuc2NyaXB0IikuIEluIGFkZGl0aW9uIHRoZXJlIGFyZSAKYSBsYXJnZSBudW1iZXIgb2YgcGFja2FnZXMgdGhhdCBtYWtlIHVzZSBvZiB0aGVzZSBhbm5vdGF0aW9ucyBmb3IgZG93bnN0cmVhbSAKYW5hbHlzZXMgYW5kIHZpc3VhbGlzYXRpb25zLiAKClVuZm9ydHVuYXRlbHksIHRoZXNlIHBhY2thZ2VzIGRvIG5vdCBjb3ZlciBhbGwgc3BlY2llcyBhbmQgdGVuZCBvbmx5IHRvIGJlCmF2YWlsYWJsZSBmb3IgVUNTQyBnZW5vbWVzLiBUaGFua2Z1bGx5LCB0aGVyZSBpcyBhIHdheSB0byBidWlsZCB5b3VyIG93biAKZGF0YWJhc2UgZnJvbSBlaXRoZXIgYSBHVEYgZmlsZSBvciBmcm9tIHZhcmlvdXMgb25saW5lIHJlc291cmNlcyBzdWNoIGFzIEJpb21hcnQKdXNpbmcgdGhlIHBhY2thZ2UKW2BHZW5vbWljRmVhdHVyZXNgXShodHRwczovL2Jpb2NvbmR1Y3Rvci5vcmcvcGFja2FnZXMvcmVsZWFzZS9iaW9jL2h0bWwvR2Vub21pY0ZlYXR1cmVzLmh0bWwpLgoKYGBge3IgY3JlYXRlVHhEQiwgbWVzc2FnZT1GQUxTRX0KbGlicmFyeShHZW5vbWljRmVhdHVyZXMpCnR4TW0gPC0gbWFrZVR4RGJGcm9tQmlvbWFydChkYXRhc2V0PSJtbXVzY3VsdXNfZ2VuZV9lbnNlbWJsIikKYGBgCgpBY2Nlc3NpbmcgdGhlIGluZm9ybWF0aW9uIGluIHRoZXNlIFR4RGIgZGF0YWJhc2VzIGlzIHNpbWlsYXIgdG8gdGhlIHdheSBpbiB3aGljaAp3ZSBhY2Nlc3NlZCBpbmZvcm1hdGlvbiB1c2luZyBgYmlvbWFSdGAgZXhjZXB0IHRoYXQgYGZpbHRlcnNgICh0aGUgaW5mb3JtYXRpb24Kd2UgYXJlIGZpbHRlcmluZyBvbikgYXJlIG5vdyBjYWxsZWQgYGtleXNgIGFuZCBgYXR0cmlidXRlc2AgKHRoaW5ncyB3ZSB3YW50IHRvCnJldHJpZXZlKSBhcmUgYGNvbHVtbnNgLgoKRmlyc3Qgd2UgbmVlZCB0byBkZWNpZGUgd2hhdCBpbmZvcm1hdGlvbiB3ZSB3YW50LiBJbiBvcmRlciB0byBzZWUgd2hhdCB3ZSBjYW4gCmV4dHJhY3Qgd2UgY2FuIHJ1biB0aGUgYGNvbHVtbnNgIGZ1bmN0aW9uIG9uIHRoZSBhbm5vdGF0aW9uIGRhdGFiYXNlLgoKYGBge3IgY2hlY2tDb2x1bW5zfQpjb2x1bW5zKHR4TW0pCmBgYAoKV2UgYXJlIGdvaW5nIHRvIGZpbHRlciB0aGUgZGF0YWJhc2UgYnkgYSBrZXkgb3Igc2V0IG9mIGtleXMgaW4gb3JkZXIgdG8gZXh0cmFjdAp0aGUgaW5mb3JtYXRpb24gd2Ugd2FudC4gVmFsaWQgbmFtZXMgZm9yIHRoZSBrZXkgY2FuIGJlIHJldHJpZXZlZCB3aXRoIHRoZSAKYGtleXR5cGVzYCBmdW5jdGlvbi4KCmBgYHtyIGNoZWNrS2V5dHlwZXN9CmtleXR5cGVzKHR4TW0pCmBgYAoKVG8gZXh0cmFjdCBpbmZvcm1hdGlvbiB3ZSB1c2UgdGhlIGBzZWxlY3RgIGZ1bmN0aW9uLiBMZXQncyBnZXQgdHJhbnNjcmlwdCAKaW5mb3JtYXRpb24gZm9yIG91ciBtb3N0IGhpZ2hseSBkaWZmZXJlbnRpYWxseSBleHByZXNzZWQgZ2VuZS4KCmBgYHtyIHNlbGVjdH0Ka2V5TGlzdCA8LSBlbnNlbWJsQW5ub3QkR2VuZUlEW2Vuc2VtYmxBbm5vdCRTeW1ib2w9PSJXYXAiXQpzZWxlY3QodHhNbSwgCiAgICAgICBrZXlzPWtleUxpc3QsCiAgICAgICBrZXl0eXBlID0gIkdFTkVJRCIsCiAgICAgICBjb2x1bW5zPWMoIlRYTkFNRSIsICJUWENIUk9NIiwgIlRYU1RBUlQiLCAiVFhFTkQiLCAiVFhTVFJBTkQiLCAiVFhUWVBFIikKICAgICAgKQpgYGAKIAoKPiAjIyMgQ2hhbGxlbmdlIDIgey5jaGFsbGVuZ2V9Cj4KPiBVc2UgdGhlIHR4TW0gdG8gcmV0cmlldmUgdGhlIGV4b24gY29vcmRpbmF0ZXMgZm9yIHRoZSBnZW5lczogCiAgICArIGBFTlNNVVNHMDAwMDAwMjE2MDRgCiAgICArIGBFTlNNVVNHMDAwMDAwMjIxNDZgCiAgICArIGBFTlNNVVNHMDAwMDAwNDAxMThgIAo+CgpgYGB7ciBzb2x1dGlvbkNoYWxsZW5nZTIsIGVjaG89RkFMU0UsIHdhcm5pbmc9RkFMU0UsIG1lc3NhZ2U9RkFMU0V9CgoKCgpgYGAKCiMjIE92ZXJ2aWV3IG9mIEdlbm9taWNSYW5nZXMKCk9uZSBvZiB0aGUgcmVhbCBzdHJlbmd0aHMgb2YgdGhlIGB0eGRiLi5gIGRhdGFiYXNlcyBpcyB0aGUgYWJpbGl0eSB0byBpbnRlcmZhY2UgCndpdGggYEdlbm9taWNSYW5nZXNgLCB3aGljaCBpcyB0aGUgb2JqZWN0IHR5cGUgdXNlZCB0aHJvdWdob3V0IEJpb2NvbmR1Y3RvciAKW3RvIG1hbmlwdWxhdGUgR2Vub21pYyAKSW50ZXJ2YWxzXShodHRwczovL3d3dy5uY2JpLm5sbS5uaWguZ292L3BtYy9hcnRpY2xlcy9QTUMzNzM4NDU4L3BkZi9wY2JpLjEwMDMxMTgucGRmKS4gCgpUaGVzZSBvYmplY3QgdHlwZXMgcGVybWl0IHVzIHRvIHBlcmZvcm0gY29tbW9uIG9wZXJhdGlvbnMgb24gaW50ZXJ2YWxzIHN1Y2ggYXMgCm92ZXJsYXBwaW5nIGFuZCBjb3VudGluZy4gV2UgY2FuIGRlZmluZSB0aGUgY2hyb21vc29tZSwgc3RhcnQgYW5kIGVuZCBwb3NpdGlvbiAKb2YgZWFjaCByZWdpb24gKGFsc28gc3RyYW5kIHRvbywgYnV0IG5vdCBzaG93biBoZXJlKS4KCmBgYHtyIHNpbXBsZUdSfQpsaWJyYXJ5KEdlbm9taWNSYW5nZXMpCnNpbXBsZV9yYW5nZSA8LSBHUmFuZ2VzKHNlcW5hbWVzID0gIjEiLCByYW5nZXMgPSBJUmFuZ2VzKHN0YXJ0PTEwMDAsIGVuZD0yMDAwKSkKc2ltcGxlX3JhbmdlCmBgYAoKV2UgZG9uJ3QgaGF2ZSB0byBoYXZlIGFsbCBvdXIgcmFuZ2VzIGxvY2F0ZWQgb24gdGhlIHNhbWUgY2hyb21vc29tZQoKYGBge3IgZ3JGb3JUaHJlZUdlbmVzfQpjaHJzIDwtIGMoIjEzIiwgIjE1IiwgIjUiKQpzdGFydCA8LSBjKDczMDAwMDAwLCA2ODAwMDAwLCAxNTAwMDAwMCkKZW5kIDwtIGMoNzQwMDAwMDAsIDY5MDAwMDAsIDE2MDAwMDAwKQoKbXlfcmFuZ2VzIDwtIEdSYW5nZXMoc2VxbmFtZXMgPSByZXAoY2hycywgMyksCiAgICAgICAgICAgICAgICAgICAgIHJhbmdlcyA9IElSYW5nZXMoc3RhcnQgPSByZXAoc3RhcnQsIGVhY2ggPSAzKSwKICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICBlbmQgPSByZXAoZW5kLCBlYWNoID0gMykpCiAgICAgICAgICAgICAgICAgICAgICkKbXlfcmFuZ2VzCmBgYAoKVGhlcmUgYXJlIGEgbnVtYmVyIG9mIHVzZWZ1bCBmdW5jdGlvbnMgZm9yIGNhbGN1bGF0aW5nIHByb3BlcnRpZXMgb2YgdGhlIGRhdGEgCihzdWNoIGFzICpjb3ZlcmFnZSogb3Igc29ydGluZykuIE5vdCBzbyBtdWNoIGZvciBSTkEtc2VxIGFuYWx5c2lzLCBidXQgCmBHZW5vbWljUmFuZ2VzYCBhcmUgdXNlZCB0aHJvdWdob3V0IEJpb2NvbmR1Y3RvciBmb3IgdGhlIGFuYWx5c2lzIG9mIE5HUyBkYXRhLiAKCkZvciBpbnN0YW5jZSwgd2UgY2FuIHF1aWNrbHkgaWRlbnRpZnkgb3ZlcmxhcHBpbmcgcmVnaW9ucyBiZXR3ZWVuIHR3byAKYEdlbm9taWNSYW5nZXNgLiAKCmBgYHtyIGZpbmRPdmVybGFwc30Ka2V5cyA8LSBjKCJFTlNNVVNHMDAwMDAwMjE2MDQiLCAiRU5TTVVTRzAwMDAwMDIyMTQ2IiwgIkVOU01VU0cwMDAwMDA0MDExOCIpCmdlbmVQb3MgPC0gc2VsZWN0KHR4TW0sCiAgICAgICAgICAgICAgICAgIGtleXMgPSBrZXlzLAogICAgICAgICAgICAgICAgICBrZXl0eXBlID0gIkdFTkVJRCIsCiAgICAgICAgICAgICAgICAgIGNvbHVtbnMgPSBjKCJFWE9OQ0hST00iLCAiRVhPTlNUQVJUIiwgIkVYT05FTkQiKQogICAgICAgICAgICAgICAgICApCgpnZW5lUmFuZ2VzIDwtIEdSYW5nZXMoZ2VuZVBvcyRFWE9OQ0hST00sIAogICAgICAgICAgICAgICAgICAgICAgcmFuZ2VzID0gSVJhbmdlcyhnZW5lUG9zJEVYT05TVEFSVCwgZ2VuZVBvcyRFWE9ORU5EKSwgCiAgICAgICAgICAgICAgICAgICAgICBHRU5FSUQgPSBnZW5lUG9zJEdFTkVJRCkKZ2VuZVJhbmdlcwoKZmluZE92ZXJsYXBzKG15X3JhbmdlcywgZ2VuZVJhbmdlcykKYGBgCgpIb3dldmVyLCB3ZSBoYXZlIHRvIHBheSBhdHRlbnRpb24gdG8gdGhlIG5hbWluZyBjb252ZW50aW9uIHVzZWQgZm9yIGVhY2ggb2JqZWN0LiAKYHNlcWxldmVsc1N0eWxlYCBjYW4gaGVscC4KCmBgYHtyIHNlcU5hbWluZ1N0eWxlfQpzZXFsZXZlbHNTdHlsZShzaW1wbGVfcmFuZ2UpCnNlcWxldmVsc1N0eWxlKG15X3JhbmdlcykKc2VxbGV2ZWxzU3R5bGUoZ2VuZVJhbmdlcykKYGBgCgoKIyMjIEV4cG9ydGluZyB0cmFja3MKCkl0IGlzIGFsc28gcG9zc2libGUgdG8gc2F2ZSB0aGUgcmVzdWx0cyBvZiBhIEJpb2NvbmR1Y3RvciBhbmFseXNpcyBpbiBhIGJyb3dzZXIgCnRvIGVuYWJsZSBpbnRlcmFjdGl2ZSBhbmFseXNpcyBhbmQgaW50ZWdyYXRpb24gd2l0aCBvdGhlciBkYXRhIHR5cGVzLCBvciBzaGFyaW5nIAp3aXRoIGNvbGxhYm9yYXRvcnMuIEZvciBpbnN0YW5jZSwgd2UgbWlnaHQgd2FudCBhIGJyb3dzZXIgdHJhY2sgdG8gaW5kaWNhdGUgCndoZXJlIG91ciBkaWZmZXJlbnRpYWxseS1leHByZXNzZWQgZ2VuZXMgYXJlIGxvY2F0ZWQuIFdlIHNoYWxsIHVzZSB0aGUgYGJlZGAgCmZvcm1hdCB0byBkaXNwbGF5IHRoZXNlIGxvY2F0aW9ucy4gV2Ugd2lsbCBhbm5vdGF0ZSB0aGUgcmFuZ2VzIHdpdGggaW5mb3JtYXRpb24gCmZyb20gb3VyIGFuYWx5c2lzIHN1Y2ggYXMgdGhlIGZvbGQtY2hhbmdlIGFuZCBzaWduaWZpY2FuY2UuCgpGaXJzdCB3ZSBjcmVhdGUgYSBkYXRhIGZyYW1lIGZvciBqdXN0IHRoZSBERSBnZW5lcy4KYGBge3IgdGFibGVPZkRFR2VuZXN9CnNpZ0dlbmVzIDwtIGZpbHRlcihyZXNUYWIsIEZEUiA8PSAwLjAxKQptZXNzYWdlKCJOdW1iZXIgb2Ygc2lnbmlmaWNhbnRseSBERSBnZW5lczogIiwgbnJvdyhzaWdHZW5lcykpCmhlYWQoc2lnR2VuZXMpCmBgYAoKIyMjIENyZWF0ZSBhIGdlbm9taWMgcmFuZ2VzIG9iamVjdAoKU2V2ZXJhbCBjb252ZW5pZW5jZSBmdW5jdGlvbnMgZXhpc3QgdG8gcmV0cmlldmUgdGhlIHN0cnVjdHVyZSBvZiBldmVyeSBnZW5lIGZyb20KYSBnaXZlbiBUeERiIG9iamVjdCBpbiBvbmUgbGlzdC4gVGhlIG91dHB1dCBvZiBgZXhvbnNCeWAgaXMgYSBsaXN0LCB3aGVyZSBlYWNoIAppdGVtIGluIHRoZSBsaXN0IGlzIHRoZSBleG9uIGNvLW9yZGluYXRlcyBvZiBhIHBhcnRpY3VsYXIgZ2VuZSwgaG93ZXZlciwgd2UgZG8gCm5vdCBuZWVkIHRoaXMgbGV2ZWwgb2YgZ3JhbnVsYXJpdHkgZm9yIHRoZSBiZWQgb3V0cHV0LCBzbyB3ZSB3aWxsIGNvbGxhcHNlIHRvIGEgCnNpbmdsZSByZWdpb24gZm9yIGVhY2ggZ2VuZS4gCgpGaXJzdCB3ZSB1c2UgdGhlIGByYW5nZWAgZnVuY3Rpb24gdG8gb2J0YWluIGEgc2luZ2xlIHJhbmdlIGZvciBldmVyeSBnZW5lIGFuZCAKdHJhbmZvcm0gdG8gYSBtb3JlIGNvbnZlbmllbnQgb2JqZWN0IHdpdGggYHVubGlzdGAuCgpgYGB7ciBnZXRHZW5lUmFuZ2VzfQpleG9SYW5nZXMgPC0gZXhvbnNCeSh0eE1tLCAiZ2VuZSIpICU+JSAKICAgIHJhbmdlKCkgJT4lIAogICAgdW5saXN0KCkKCnNpZ1JlZ2lvbnMgPC0gZXhvUmFuZ2VzW25hLm9taXQobWF0Y2goc2lnR2VuZXMkR2VuZUlELCBuYW1lcyhleG9SYW5nZXMpKSldCnNpZ1JlZ2lvbnMKYGBgCgpGb3IgdmlzdWFsaXNhdGlvbiBwdXJwb3Nlcywgd2UgYXJlIGdvaW5nIHRvIHJlc3RyaWN0IHRoZSBkYXRhIHRvIGdlbmVzIHRoYXQgYXJlIApsb2NhdGVkIG9uIGNocm9tb3NvbWVzIDEgdG8gMTkgYW5kIHRoZSBzZXggY2hyb21vc29tZXMuIFRoaXMgY2FuIGJlIGRvbmUgd2l0aCAKdGhlIGBrZWVwU2VxTGV2ZWxzYCBmdW5jdGlvbi4KCmBgYHtyIHRyaW1TZXF1ZW5jZXN9CnNlcWxldmVscyhzaWdSZWdpb25zKQpzaWdSZWdpb25zIDwtIGtlZXBTZXFsZXZlbHMoc2lnUmVnaW9ucywgCiAgICAgICAgICAgICAgICAgICAgICAgICAgICB2YWx1ZSA9IGMoMToxOSwiWCIsIlkiKSwKICAgICAgICAgICAgICAgICAgICAgICAgICAgIHBydW5pbmcubW9kZT0idGlkeSIpCnNlcWxldmVscyhzaWdSZWdpb25zKQpgYGAKCiMjIyBBZGQgbWV0YWRhdGEgdG8gR1JhbmdlcyBvYmplY3QKCkEgdXNlZnVsIHByb3Blcnkgb2YgR2Vub21pY1JhbmdlcyBpcyB0aGF0IHdlIGNhbiBhdHRhY2ggKm1ldGFkYXRhKiB0byBlYWNoIHJhbmdlCnVzaW5nIHRoZSBgbWNvbHNgIGZ1bmN0aW9uLiBUaGUgbWV0YWRhdGEgY2FuIGJlIHN1cHBsaWVkIGluIHRoZSBmb3JtIG9mIGEgZGF0YSAKZnJhbWUuCgpgYGB7ciBhZGRERVJlc3VsdHN9Cm1jb2xzKHNpZ1JlZ2lvbnMpIDwtIHNpZ0dlbmVzW21hdGNoKG5hbWVzKHNpZ1JlZ2lvbnMpLCBzaWdHZW5lcyRHZW5lSUQpLCBdCnNpZ1JlZ2lvbnMKYGBgCgojIyMgU2NvcmVzIGFuZCBjb2xvdXIgb24gZXhwb3J0ZWQgdHJhY2tzCgpUaGUgYC5iZWRgIGZpbGUgZm9ybWF0IGlzIGNvbW1vbmx5IHVzZWQgdG8gc3RvcmUgZ2Vub21pYyBsb2NhdGlvbnMgZm9yIGRpc3BsYXkgCmluIGdlbm9tZSBicm93c2VycyAoZS5nLiB0aGUgVUNTQyBicm93c2VyIG9yIElHVikgYXMgdHJhY2tzLiBSYXRoZXIgdGhhbiBqdXN0IApyZXByZXNlbnRpbmcgdGhlIGdlbm9taWMgbG9jYXRpb25zLCB0aGUgYC5iZWRgIGZvcm1hdCBpcyBhbHNvIGFibGUgdG8gY29sb3VyIAplYWNoIHJhbmdlIGFjY29yZGluZyB0byBzb21lIHByb3BlcnR5IG9mIHRoZSBhbmFseXNpcyAoZS5nLiBkaXJlY3Rpb24gYW5kIAptYWduaXR1ZGUgb2YgY2hhbmdlKSB0byBoZWxwIGhpZ2hsaWdodCBwYXJ0aWN1bGFyIHJlZ2lvbnMgb2YgaW50ZXJlc3QuIEEgc2NvcmUKY2FuIGFsc28gYmUgZGlzcGxheWVkIHdoZW4gYSBwYXJ0aWN1bGFyIHJlZ2lvbiBpcyBjbGlja2VkLW9uLgoKRm9yIHRoZSBzY29yZSB3ZSBjYW4gdXNlIHRoZSAkLWxvZ197MTB9JCBvZiB0aGUgYWRqdXN0ZWQgcC12YWx1ZSBhbmQgCmNvbG91ciBzY2hlbWUgZm9yIHRoZSByZWdpb25zIGJhc2VkIG9uIHRoZSBmb2xkLWNoYW5nZQoKYGNvbG9yUmFtcFBhbGV0dGVgIGlzIGEgdXNlZnVsIGZ1bmN0aW9uIGluIGJhc2UgUiBmb3IgY29uc3RydWN0aW5nIGEgcGFsZXR0ZSBiZXR3ZWVuIHR3byBleHRyZW1lcy4gKipXaGVuIGNob29zaW5nIGNvbG91ciBwYWxldHRlcywgbWFrZSBzdXJlIHRoZXkgYXJlIGNvbG91ciBibGluZCBmcmllbmRseSoqLiBUaGUgcmVkIC8gZ3JlZW4gY29sb3VyIHNjaGVtZSB0cmFkaXRpb25hbGx5LWFwcGxpZWQgdG8gbWljcm9hcnJheXMgaXMgYSAqKipiYWQqKiogY2hvaWNlLgoKV2Ugd2lsbCBhbHNvIHRydW5jYXRlIHRoZSBmb2xkLWNoYW5nZXMgdG8gYmV0d2VlbiAtNSBhbmQgNSB0byBhbmQgZGl2aWRlIHRoaXMgcmFuZ2UgaW50byAxMCBlcXVhbCBiaW5zCgpgYGB7cn0KcmJQYWwgPC0gY29sb3JSYW1wUGFsZXR0ZShjKCJyZWQiLCAiYmx1ZSIpKQpsb2dGQyA8LSBwbWF4KHNpZ1JlZ2lvbnMkbG9nRkMsIC01KQpsb2dGQyA8LSBwbWluKGxvZ0ZDICwgNSkKCkNvbHMgPC0gcmJQYWwoMTApW2FzLm51bWVyaWMoY3V0KGxvZ0ZDLCBicmVha3MgPSAxMCkpXQpgYGAKClRoZSBjb2xvdXJzIGFuZCBzY29yZSBoYXZlIHRvIGJlIHNhdmVkIGluIHRoZSBHUmFuZ2VzIG9iamVjdCBhcyBgc2NvcmVgIGFuZCBgaXRlbVJnYmAgY29sdW1ucyByZXNwZWN0aXZlbHksIGFuZCB3aWxsIGJlIHVzZWQgdG8gY29uc3RydWN0IHRoZSBicm93c2VyIHRyYWNrLiBUaGUgcnRyYWNrbGF5ZXIgcGFja2FnZSBjYW4gYmUgdXNlZCB0byBpbXBvcnQgYW5kIGV4cG9ydCBicm93c2VycyB0cmFja3MuCgpOb3cgd2UgY2FuIGV4cG9ydCB0aGUgc2lnbmlmY2FudCByZXN1bHRzIGZyb20gdGhlIERFIGFuYWx5c2lzIGFzIGEgYC5iZWRgIHRyYWNrIHVzaW5nIGBydHJhY2tsYXllcmAuIFlvdSBjYW4gbG9hZCB0aGUgcmVzdWx0aW5nIGZpbGUgaW4gSUdWLCBpZiB5b3Ugd2lzaC4KCmBgYHtyfQptY29scyhzaWdSZWdpb25zKSRzY29yZSA8LSAtbG9nMTAoc2lnUmVnaW9ucyRGRFIpCm1jb2xzKHNpZ1JlZ2lvbnMpJGl0ZW1SZ2IgPC0gQ29scwpzaWdSZWdpb25zCgpsaWJyYXJ5KHJ0cmFja2xheWVyKQpleHBvcnQoc2lnUmVnaW9ucyAsIGNvbiA9ICJyZXN1bHRzL3RvcEhpdHMuYmVkIikKYGBgCgojIyBFeHRyYWN0aW5nIFJlYWRzCgpBcyB3ZSBoYXZlIGJlZW4gdXNpbmcgY291bnRzIGFzIG91ciBzdGFydGluZyBwb2ludCwgd2UgaGF2ZW4ndCBpbnZlc3RpZ2F0ZWQgdGhlIGFsaWduZWQgcmVhZHMgZnJvbSBvdXIgZXhwZXJpbWVudCwgYW5kIGhvdyB0aGV5IGFyZSByZXByZXNlbnRlZC4gQXMgeW91IG1heSBiZSBhd2FyZSwgYWxpZ25lZCByZWFkcyBhcmUgdXN1YWxseSBzdG9yZWQgaW4gYSAqYmFtKiBmaWxlIHRoYXQgY2FuIGJlIG1hbmlwdWxhdGVkIHdpdGggb3Blbi1zb3VyY2UgY29tbWFuZC1saW5lIHRvb2xzIHN1Y2ggYXMgWypzYW10b29scypdKGh0dHA6Ly93d3cuaHRzbGliLm9yZy8pIGFuZCBbKnBpY2FyZCpdKGh0dHBzOi8vYnJvYWRpbnN0aXR1dGUuZ2l0aHViLmlvL3BpY2FyZC8pLiBCaW9jb25kdWN0b3IgcHJvdmlkZSBhIGxvdy1sZXZlbCBpbnRlcmZhY2UgdG8gZGF0YS9iYW0vc2FtIGZpbGVzIGluIHRoZSBmb3JtIG9mIHRoZSBgUnNhbXRvb2xzYCBwYWNrYWdlLiBUaGUgYEdlbm9taWNBbGlnbm1lbnRzYCBwYWNrYWdlIGNhbiBhbHNvIGJlIHVzZWQgdG8gcmV0cmlldmUgdGhlIHJlYWRzIG1hcHBpbmcgdG8gYSBwYXJ0aWN1bGFyIGdlbm9taWMgcmVnaW9uIGluIGFuIGVmZmljaWVudCBtYW5uZXIuCgpgYGB7ciBtZXNzYWdlPUZBTFNFfQpsaWJyYXJ5KEdlbm9taWNBbGlnbm1lbnRzKQpgYGAKCkluIHRoZSBkaXJlY3RvcnkgYHNtYWxsX2JhbXNgIHRoZXJlIHNob3VsZCBiZSBgLmJhbWAgZmlsZXMgZm9yIHNvbWUgb2YgdGhlIHNhbXBsZXMgaW4gdGhlIGV4YW1wbGUgc3R1ZHkuIFRoZSB3b3JrZmxvdyB0byBwcm9kdWNlIHRoZXNlIGZpbGVzIGlzIGRlc2NyaWJlZCBpbiBhIFtzdXBwbG1lbnRhcnkgcGFnZV0oLi4vU3VwcGxlbWVudGFyeV9NYXRlcmlhbHMvUzFfR2V0dGluZ19yYXdfcmVhZHNfZnJvbV9TUkEubmIuaHRtbCkgZm9yIHRoZSBjb3Vyc2UuIEluIGJyaWVmLCB0aGUgcmF3IHJlYWRzIChgZmFzdHFgKSB3ZXJlIGRvd25sb2FkZWQgZnJvbSB0aGUgU2hvcnQgUmVhZCBBcmNoaXZlIChTUkEpIGFuZCBhbGlnbmVkIHdpdGggYGhpc2F0MmAuIEVhY2ggYmFtIGZpbGUgd2FzIG5hbWVkIGFjY29yZGluZyB0byB0aGUgZmlsZSBuYW1lIGluIFNSQSwgYnV0IHdlIGhhdmUgcmVuYW1lZCB0aGUgZmlsZXMgYWNjb3JkaW5nIHRvIHRoZWlyIG5hbWUgaW4gdGhlIHN0dWR5LiBBbiBpbmRleCBmaWxlIChgLmJhaWApIGhhcyBiZWVuIGdlbmVyYXRlZCBmb3IgZWFjaCBiYW0gZmlsZS4gSW4gb3JkZXIgdG8gcmVkdWNlIHRoZSBzaXplLCB0aGUgYmFtIGZpbGVzIHVzZWQgaGVyZSBvbmx5IGNvbnRhaW4gYSBzdWJzZXQgb2YgdGhlIHJlYWRzIHRoYXQgd2VyZSBhbGlnbmVkIGluIHRoZSByZWdpb24gY2hyMTU6MTAxNzA3MDAwLTEwMTcxMzAwMC4KCgpgYGB7cn0KbGlzdC5maWxlcygiY291bnRzL3NtYWxsX2JhbXMvIikKYGBgCgpUaGUgYHJlYWRHQWxpZ25tZW50c2AgZnVuY3Rpb24gcHJvdmlkZXMgYSBzaW1wbGUgaW50ZXJmYWNlIHRvIGludGVycm9nYXRlIHRoZSBhbGlnbmVkIHJlYWRzIGZvciBhIHBhcnRpY3VsYXIgc2FtcGxlLiBJdCBjYW4gYWxzbyB1dGlsaXNlIHRoZSAqaW5kZXgqIGZpbGUgaW4gb3JkZXIgdG8gcmV0cmlldmUgb25seSB0aGUgcmVhZHMgdGhhdCBjb3JyZXNwb25kIHRvIGEgc3BlY2lmaWMgcmVnaW9uIGluIGFuIGVmZmljaWVudCBtYW5uZXIuIFRoZSBvdXRwdXQgaW5jbHVkZXMgdGhlIGdlbm9taWMgbG9jYXRpb24gb2YgZWFjaCBhbGlnbmVkIHJlYWQgYW5kIHRoZSBDSUdBUiAoKipDKipvbXBhY3QgKipJKipkaW9zeW5jcmF0aWMgKipHKiphcHBlZCAqKkEqKmxpZ25tZW50ICoqUioqZXBvcnQpOyB3aGVyZSAqTSogZGVub3RlcyBhbiBtYXRjaCB0byB0aGUgZ2Vub21lIGFuZCAqSSosICpEKiBjb3JyZXNwb25kIHRvIGluc2VydGlvbnMgYW5kIGRlbGV0aW9ucy4KCmBgYHtyfQpleG8gPC0gZXhvbnNCeSh0eE1tLCAiZ2VuZSIpIApnZW5lcmVnaW9uIDwtIGV4b1tbIkVOU01VU0cwMDAwMDAyMjE0NiJdXSAlPiUgCiAgICBrZWVwU2VxbGV2ZWxzKHZhbHVlID0gMTUsIHBydW5pbmcubW9kZT0idGlkeSIpCgpteS5yZWFkcyA8LSByZWFkR0FsaWdubWVudHMoZmlsZT0iY291bnRzL3NtYWxsX2JhbXMvTUNMMS5ERy4xNS5zbS5iYW0iLAogICAgICAgICAgICAgICAgICAgICAgIHBhcmFtPVNjYW5CYW1QYXJhbSh3aGljaD1nZW5lcmVnaW9uKSkKbXkucmVhZHMKYGBgCgpJdCBpcyBwb3NzaWJsZSB0byB0d2VhayB0aGUgZnVuY3Rpb24gdG8gcmV0cmlldmUgb3RoZXIgcG90ZW50aWFsbHktdXNlZnVsIGluZm9ybWF0aW9uIGZyb20gdGhlIGJhbSBmaWxlLCBzdWNoIGFzIHRoZSBtYXBwaW5nIHF1YWxpdHkgYW5kIGZsYWcuCgpgYGB7cn0KbXkucmVhZHMgPC0gcmVhZEdBbGlnbm1lbnRzKGZpbGU9ImNvdW50cy9zbWFsbF9iYW1zL01DTDEuREcuMTUuc20uYmFtIiwKICAgICAgICAgICAgICAgICAgICAgICBwYXJhbT1TY2FuQmFtUGFyYW0od2hpY2g9Z2VuZXJlZ2lvbiwKICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgd2hhdD1jKCJzZXEiLCJtYXBxIiwiZmxhZyIpKSkKbXkucmVhZHMKYGBgCgpUaGUgZmxhZyBjYW4gcmVwcmVzZW50IHVzZWZ1bCBRQyBpbmZvcm1hdGlvbi4gZS5nLgoKICArIFJlYWQgaXMgdW5tYXBwZWQKICArIFJlYWQgaXMgcGFpcmVkIC8gdW5wYWlyZWQKICArIFJlYWQgZmFpbGVkIFFDCiAgKyBSZWFkIGlzIGEgUENSIGR1cGxpY2F0ZSAoc2VlIGxhdGVyKQoKVGhlIGNvbWJpbmF0aW9uIG9mIGFueSBvZiB0aGVzZSBwcm9wZXJ0aWVzIGlzIHVzZWQgdG8gZGVyaXZlIGEgbnVtZXJpYyB2YWx1ZSwgYXMgaWxsdXN0cmF0ZWQgaW4gdGhpcyB1c2VmdWwgW3Jlc291cmNlXShodHRwczovL2Jyb2FkaW5zdGl0dXRlLmdpdGh1Yi5pby9waWNhcmQvZXhwbGFpbi1mbGFncy5odG1sKQoKUGFydGljdWxhciBhdHRyaWJ1dGVzIG9mIHRoZSByZWFkcyBjYW4gYmUgZXh0cmFjdGVkIGFuZCB2aXN1YWxpc2VkCgpgYGB7cn0KaGlzdChtY29scyhteS5yZWFkcykkbWFwcSwgbWFpbj0iIiwgeGxhYj0iTUFQUSIpCmBgYAoKSG93ZXZlciwgdGhlcmUgYXJlIG1vcmUtc29waGlzdGljYXRlZCB2aXN1YWxpc2F0aW9uIG9wdGlvbnMgZm9yIGFsaWduZWQgcmVhZHMgYW5kIHJhbmdlIGRhdGEuIFdlIHdpbGwgdXNlIHRoZSBgZ2diaW9gIHBhY2thZ2UsIHdoaWNoIGZpcnN0IHJlcXVpcmVzIHNvbWUgZGlzY3Vzc2lvbiBvZiB0aGUgYGdncGxvdDJgIHBsb3R0aW5nIHBhY2thZ2UuCgoKIyMgQ29tcG9zaW5nIHBsb3RzIHdpdGggZ2diaW8KCldlIHdpbGwgbm93IHRha2UgYSBicmllZiBsb29rIGF0IG9uZSBvZiB0aGUgdmlzdWFsaXNhdGlvbiBwYWNrYWdlcyBpbiBCaW9jb25kdWN0b3IgdGhhdCB0YWtlcyBhZHZhbnRhZ2UKb2YgdGhlIEdlbm9taWNSYW5nZXMgYW5kIEdlbm9taWNGZWF0dXJlcyBvYmplY3QtdHlwZXMuIEluIHRoaXMgc2VjdGlvbiB3ZSB3aWxsIHNob3cgYSB3b3JrZWQKZXhhbXBsZSBvZiBob3cgdG8gY29tYmluZSBzZXZlcmFsIHR5cGVzIG9mIGdlbm9taWMgZGF0YSBvbiB0aGUgc2FtZSBwbG90LiBUaGUgZG9jdW1lbnRhdGlvbiBmb3IKZ2diaW8gaXMgdmVyeSBleHRlbnNpdmUgYW5kIGNvbnRhaW5zIGxvdHMgb2YgZXhhbXBsZXMuCgpodHRwOi8vd3d3LnRlbmdmZWkubmFtZS9nZ2Jpby9kb2NzLwoKVGhlIGBHdml6YCBwYWNrYWdlIGlzIGFub3RoZXIgQmlvY29uZHVjdG9yIHBhY2thZ2UgdGhhdCBzcGVjaWFsaXNpbmcgaW4gZ2Vub21pYyB2aXN1YWxpc2F0aW9ucywgYnV0IHdlCndpbGwgbm90IGV4cGxvcmUgdGhpcyBwYWNrYWdlIGluIHRoZSBjb3Vyc2UuCgpUaGUgTWFuaGF0dGFuIHBsb3QgaXMgYSBjb21tb24gd2F5IG9mIHZpc3VhbGlzaW5nIGdlbm9tZS13aWRlIHJlc3VsdHMsIGVzcGVjaWFsbHkgd2hlbiBvbmUgaXMgY29uY2VybmVkIHdpdGggdGhlIHJlc3VsdHMgb2YgYSBHV0FTIHN0dWR5IGFuZCBpZGVudGlmeWluZyBzdHJvbmdseS1hc3NvY2lhdGVkIGhpdHMuIAoKVGhlIHByb2ZpbGUgaXMgc3VwcG9zZWQgdG8gcmVzZW1ibGUgdGhlIE1hbmhhdHRhbiBza3lsaW5lIHdpdGggcGFydGljdWxhciBza3lzY3JhcGVycyB0b3dlcmluZyBhYm91dCB0aGUgbG93ZXIgbGV2ZWwgYnVpbGRpbmdzLgoKIVtdKGh0dHBzOi8vdXBsb2FkLndpa2ltZWRpYS5vcmcvd2lraXBlZGlhL2NvbW1vbnMvMS8xMi9NYW5oYXR0YW5fUGxvdC5wbmcpCgpUaGlzIHR5cGUgb2YgcGxvdCBpcyBpbXBsZW1lbnRlZCBhcyB0aGUgYHBsb3RHcmFuZExpbmVhcmAgZnVuY3Rpb24uIFdlIGhhdmUgdG8gc3VwcGx5IGEgdmFsdWUgdG8gZGlzcGxheSBvbiB0aGUgeS1heGlzIHVzaW5nIHRoZSBgYWVzYCBmdW5jdGlvbiwKd2hpY2ggaXMgaW5oZXJpdGVkIGZyb20gZ2dwbG90Mi4gVGhlIHBvc2l0aW9uaW5nIG9mIHBvaW50cyBvbiB0aGUgeC1heGlzIGlzIGhhbmRsZWQgYXV0b21hdGljYWxseSBieQpnZ2JpbywgdXNpbmcgdGhlIHJhbmdlcyBpbmZvcm1hdGlvbiB0byBnZXQgdGhlIGdlbm9taWMgY29vcmRpbmF0ZXMgb2YgdGhlIHJhbmdlcyBvZiBpbnRlcmVzdC4KClRvIHN0b3AgdGhlIHBsb3RzIGZyb20gYmVpbmcgdG9vIGNsdXR0ZXJlZCB3ZSB3aWxsIGNvbnNpZGVyIHRoZSB0b3AgMjAwIGdlbmVzIG9ubHkuCgpgYGB7cixmaWcud2lkdGg9MTIsZmlnLmhlaWdodD01fQpsaWJyYXJ5KGdnYmlvKQp0b3AyMDAgPC0gc2lnUmVnaW9uc1tvcmRlcihzaWdSZWdpb25zJEZEUilbMToyMDBdXQoKcGxvdEdyYW5kTGluZWFyKHRvcDIwMCAsIGFlcyh5ID0gbG9nRkMpKQoKYGBgCgpgZ2diaW9gIGhhcyBhbHRlcm5hdGVkIHRoZSBjb2xvdXJzIG9mIHRoZSBjaHJvbW9zb21lcy4gSG93ZXZlciwgYW4gYXBwZWFsaW5nIGZlYXR1cmUgb2YgYGdncGxvdDJgIGlzIHRoZSBhYmlsaXR5IHRvIG1hcCBwcm9wZXJ0aWVzIG9mIHlvdXIgcGxvdCB0byB2YXJpYWJsZXMgcHJlc2VudCBpbiB5b3VyIGRhdGEuIEZvciBleGFtcGxlLCB3ZSBjb3VsZCBjcmVhdGUgYSB2YXJpYWJsZSB0byBkaXN0aW5ndWlzaCBiZXR3ZWVuIHVwLSBhbmQgZG93bi1yZWd1bGF0ZWQgZ2VuZXMuIFRoZSB2YXJpYWJsZXMgdXNlZCBmb3IgYWVzdGhldGljIG1hcHBpbmcgbXVzdCBiZSBwcmVzZW50IGluIHRoZSBgbWNvbHNgIHNlY3Rpb24gb2YgeW91ciByYW5nZXMgb2JqZWN0LgoKYGBge3IsZmlnLndpZHRoPTEyLGZpZy5oZWlnaHQ9NX0KbWNvbHModG9wMjAwKSRVcFJlZ3VsYXRlZCA8LSBtY29scyh0b3AyMDApJGxvZ0ZDID4gMAoKcGxvdEdyYW5kTGluZWFyKHRvcDIwMCwgYWVzKHkgPSBsb2dGQywgY29sID0gVXBSZWd1bGF0ZWQpKQpgYGAKCmBwbG90R3JhbmRMaW5lYXJgIGlzIGEgc3BlY2lhbCBmdW5jdGlvbiBpbiBgZ2diaW9gIHdpdGggcHJlc2V0IG9wdGlvbnMgZm9yIHRoZSBtYW5oYXR0YW4gc3R5bGUgb2YgcGxvdC4gTW9yZSBvZnRlbiwgdXNlcnMgd2lsbCBjYWxsIHRoZSBgYXV0b3Bsb3RgIGZ1bmN0aW9uIGFuZCBgZ2diaW9gIHdpbGwgY2hvb3NlIHRoZSBtb3N0IGFwcHJvcHJpYXRlIGxheW91dC4gT25lIHN1Y2ggbGF5b3V0IGlzIHRoZSAqa2FyeW9ncmFtKi4gCgpgYGB7cixmaWcud2lkdGg9MTIsZmlnLmhlaWdodD01fQoKYXV0b3Bsb3QodG9wMjAwLCBsYXlvdXQ9ImthcnlvZ3JhbSIsIGFlcyhjb2xvcj1VcFJlZ3VsYXRlZCwKICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgZmlsbD1VcFJlZ3VsYXRlZCkpCgpgYGAKCgoKYGdnYmlvYCBpcyBhbHNvIGFibGUgdG8gcGxvdCB0aGUgc3RydWN0dXJlIG9mIGdlbmVzIGFjY29yZGluZyB0byBhIHBhcnRpY3VsYXIgbW9kZWwgcmVwcmVzZW50ZWQgYnkgYSBgR2Vub21pY0ZlYXR1cmVzYCBvYmplY3QsIHN1Y2ggYXMgdGhlIG9iamVjdCB3ZSBjcmVhdGVkIGVhcmxpZXIgd2l0aCB0aGUgZXhvbiBjb29yZGluYXRlcyBmb3IgZWFjaCBnZW5lIGluIHRoZSBHUkNtMzggZ2Vub21lLgoKCmBgYHtyfQphdXRvcGxvdCh0eE1tLCB3aGljaD1leG9bWyJFTlNNVVNHMDAwMDAwMjIxNDYiXV0pCmBgYAoKV2UgY2FuIGV2ZW4gcGxvdCB0aGUgbG9jYXRpb24gb2Ygc2VxdWVuY2luZyByZWFkcyBpZiB0aGV5IGhhdmUgYmVlbiBpbXBvcnRlZCB1c2luZyByZWFkR0FsaWdubWVudHMgZnVuY3Rpb24gKG9yIHNpbWlsYXIpLgoKYGBge3J9Cm15cmVnIDwtIGV4b1tbIkVOU01VU0cwMDAwMDAyMjE0NiJdXSAlPiUgCiAgICBHZW5vbWljUmFuZ2VzOjpyZWR1Y2UoKSAlPiUgCiAgICBmbGFuayh3aWR0aCA9IDEwMDAsIGJvdGggPSBUKSAlPiUgCiAgICBrZWVwU2VxbGV2ZWxzKHZhbHVlID0gMTUsIHBydW5pbmcubW9kZT0idGlkeSIpCgpiYW0gPC0gcmVhZEdhcHBlZFJlYWRzKGZpbGU9ImNvdW50cy9zbWFsbF9iYW1zL01DTDEuREcuMTUuc20uYmFtIiwKICAgICAgICAgICAgICAgICAgICAgICBwYXJhbT1TY2FuQmFtUGFyYW0od2hpY2g9bXlyZWcpLCB1c2UubmFtZXMgPSBUUlVFKQoKYXV0b3Bsb3QoYmFtLCBnZW9tID0gInJlY3QiKSArIAogICAgeGxpbShHUmFuZ2VzKCIxNSIsIElSYW5nZXMoNjgwMDAwMCwgNjkwMDAwMCkpKQpgYGAKCkxpa2UgZ2dwbG90MiwgZ2diaW8gcGxvdHMgY2FuIGJlIHNhdmVkIGFzIG9iamVjdHMgdGhhdCBjYW4gbGF0ZXIgYmUgbW9kaWZpZWQsIG9yIGNvbWJpbmVkIHRvZ2V0aGVyIHRvCmZvcm0gbW9yZSBjb21wbGljYXRlZCBwbG90cy4gSWYgc2F2ZWQgaW4gdGhpcyB3YXksIHRoZSBwbG90IHdpbGwgb25seSBiZSBkaXNwbGF5ZWQgb24gYSBwbG90dGluZyBkZXZpY2UKd2hlbiB3ZSBxdWVyeSB0aGUgb2JqZWN0LiBUaGlzIHN0cmF0ZWd5IGlzIHVzZWZ1bCB3aGVuIHdlIHdhbnQgdG8gYWRkIGEgY29tbW9uIGVsZW1lbnQgKHN1Y2ggYXMKYW4gaWRlb2dyYW0pIHRvIGEgcGxvdCBjb21wb3NpdGlvbiBhbmQgZG9u4oCZdCB3YW50IHRvIHJlcGVhdCB0aGUgY29kZSB0byBnZW5lcmF0ZSB0aGUgcGxvdCBldmVyeSB0aW1lLgoKYGBge3IsIG1lc3NhZ2U9RkFMU0V9CmdlbmVNb2QgPC0gYXV0b3Bsb3QodHhNbSwgd2hpY2ggPSBteXJlZykgICsgCiAgICB4bGltKEdSYW5nZXMoIjE1IiwgSVJhbmdlcyg2ODEwMDAwLCA2ODgwMDAwKSkpCnJlYWRzLk1DTDEuREcgPC0gYXV0b3Bsb3QoYmFtLCBzdGF0ID0gImNvdmVyYWdlIikgICsgCiAgICB4bGltKEdSYW5nZXMoIjE1IiwgSVJhbmdlcyg2ODEwMDAwLCA2ODgwMDAwKSkpICsKICAgIGxhYnModGl0bGU9Ik1DTDEuREciKQp0cmFja3MoR1JDbTM4PWdlbmVNb2QsIE1DTDEuREc9cmVhZHMuTUNMMS5ERyApCmBgYAoKPiAjIyBDaGFsbGVuZ2Ugey5jaGFsbGVuZ2V9Cj4KPiBDcmVhdGUgdHJhY2tzIHRvIGNvbXBhcmUgdGhlIGNvdmVyYWdlIG9mIHRoZSBnZW5lIEtydDUgZm9yIHRoZSBzYW1wbGVzIE1DTDEuREcsIE1DTDEuREgsIE1DTDEuTEEgYW5kIE1DTDEuTEIKPgoKYGBge3IsZWNobz1GQUxTRSxmaWcuaGVpZ2h0PTUsZmlnLndpZHRoPTEwfQoKCmBgYAoK